Advance Search              Latest Recources





















Showing search results (1–1 of 1):

  1. UniRef (View Publication)
    Full Name of the Resource : Clustered sets of related sequences from UniProt
    Resource Category : Databases -> Protein Sequence Databases -> General Protein Sequence Databases

    Brief Description : The UniProt Reference Clusters are three separate datasets that compress sequence space at different resolutions, achieved by merging sequences and sub-sequences that are 100% (UniRef100), >=90% (UniRef90), or >=50% (UniRef50) identical, regardless of source organism. The UniRef100 database provides the most comprehensive non-redundant coverage of the known protein sequence space including not only all of UniProtKB but also splice variants that are not separated out in these databases, as well as additional active sequences from UniParc. The UniRef90 and UniRef50 databases provide a more even sampling of sequences by reducing the numbers of closely related sequence. This speeds sequence similarity searches while rendering such searches more informative. The compression of UniRef100 into UniRef90 and UniRef50 yields size reductions of approximately 40% and 65%, respectively.
    Subject Area : Universal Protein Resource


    Institute/s :
    Protein Information Resource, Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center, Washington, DC USA.
    Address of Institute/s :
    Protein Information Resource, Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center, Washington, DC 20007, USA.
    Country : United States

    Associated Institutes :

    • Department of Biochemistry and Molecular Biology, Georgetown University Medical Center 3900 Reservoir Road, NW, Washington, DC 20057-1414, USA
    • The EMBL Outstation, The European Bioinformatics Institute Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
    • Swiss Institute of Bioinformatics, Centre Medical Universitaire 1 rue Michel Servet, 1211 Geneva 4, Switzerland
    • National Biomedical Research Foundation 3900 Reservoir Road, NW, Washington, DC 20057-1414, USA

    Associated Country : USA; United Kingdom; Switzerland


    Authors/Contributors : Cathy H. Wu; Rolf Apweiler1; Amos Bairoch; Darren A. Natale; Winona C. Barker; Brigitte Boeckmann; Serenella Ferro; Elisabeth Gasteiger; Hongzhan Huang; Rodrigo Lopez; Michele Magrane; Maria J. Martin; Raja Mazumder; Claire O'Donovan; Nicole Redaschi; Baris Suzek
    Contact Email : bes23@georgetown.edu
    Year : 2005
    Language : English

    Keywords : Amino Acid Sequence; Animals; Computational Biology; Databases, Protein; Humans; Information Storage and Retrieval; Proteins chemistry