Advance Search              Latest Recources

Showing search results (1–1 of 1):

  1. BioThesaurus (View Publication)
    Full Name of the Resource : A collection of gene/protein names and associated sequences
    Resource Category : Databases -> Genomic Databases (Non-Human) -> Genome Annotation Terms, Ontologies and Nomenclature

    Brief Description : BioThesaurus is a web-based system that maps a comprehensive collection of protein and gene names to protein entries in the UniProt Knowledgebase (UniProtKB). Currently covering more than two million protein sequences, BioThesaurus consists of over 2.8 million names extracted from multiple molecular biology databases according to the database cross-references provided in iProClass (Wu et al, 2004). The BioThesaurus web site allows the retrieval of synonymous names of given protein entries and the identification of protein entries sharing the same names. The BioThesaurus dataset can be used for automatic protein named entity recognition. It is updated monthly and can be freely downloaded at
    Subject Area : Biothesaurus

    Institute/s :
    University of Maryland at Baltimore County, 1000 Hilltop Circle, Baltimore, MD 21250, USA 2Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC 20057, USA
    Country : United States

    Associated Institutes :

    • Department of Information Systems, University of Maryland at Baltimore County 1000 Hilltop Circle, MD 21250, USA
    • Department of Biochemistry and Molecular Biology, Georgetown University Medical Center Washington DC, USA

    Associated Country : USA

    Authors/Contributors : Liu H.
    Contact Email :
    Year : 2005
    Language : English

    Keywords : Animals; Computational Biology / *methods; Databases, Factual; Databases, Genetic; Databases, Protein; Genome; Humans; Information Storage and Retrieval; Internet; Models, Genetic; Names; Proteins; Terminology as Topic; *Vocabulary, Controlled