Advance Search              Latest Recources

Showing search results (1–1 of 1):

  1. eBLOCKS (View Publication)
    Full Name of the Resource : Highly conserved protein sequence blocks
    Resource Category : Databases -> Protein Sequence Databases -> Protein sequence motifs and active sites

    Brief Description : Classifying proteins into families and super-families allows identification of functionally mportant conserved domains. The motifs and scoring matrices derived from such conserved regions provide computational tools to recognize similar patterns in novel sequences, and thus enable protein function prediction for genomes. The eBLOCKs database enumerates a cascade of protein blocks with varied conservation levels for each functional domain. A biologically important region is most stringently conserved among a smaller family of highly similar proteins. The same region is often found in a larger group of more remotely related proteins with a reduced stringency. Through enumeration, highly specific signatures can be generated from blocks with more columns and fewer family members, while highly sensitive signatures can be derived from blocks with fewer columns and more members as in a super-family. By applying PSI-BLAST and a modified K-means clustering algorithm, eBLOCKs automatically groups protein sequences according to different levels of similarity. Multiple sequence alignments are made and trimmed into a series of ungapped blocks. Motifs and position specific scoring matrices were derived from eBLOCKs and made available for sequence search and annotation. The eBLOCKs database provides a tool for high throughput genome annotation with maximal specificity and sensitivity. The eBLOCKs database is freely available on the World Wide Web at to all users for on-line usage. Academic and not for profit institutions wishing copies of the program should contact Douglas L. Brutlag ( or Jacqueline Tay (;
    Subject Area : Protein Blocks, Enumeration

    Institute/s :
    Department of Biochemistry, Stanford University, Stanford, USA
    Address of Institute/s :
    Department of Biochemistry, Stanford University, Stanford, CA 94305-5307
    Country : United States

    Associated Institutes :

    • Abgenix, Inc., 6701 Kaiser Drive, MS 11, Fremont, CA 94555, USA
    • Oracle Corporation, 400 Oracle Parkway, Redwood shores, CA 94065, USA
    • Department of Biochemistry and 3Biomedical Informatics, Stanford University, Stanford, CA 94305, USA

    Associated Country : USA

    Authors/Contributors : Douglas L. Brutlag
    Contact Email :
    Year : 2004
    Language : English

    Keywords : Algorithms; Amino Acid Sequence; Conserved Sequence; Databases, Protein; Protein Structure, Tertiary; Proteins / classification; Sequence Alignment; Sequence Analysis, Protein; Software