Advance Search              Latest Recources





















Showing search results (1–2 of 2):

  1. Blocks (View Publication)
    Full Name of the Resource : Alignments of conserved regions in protein families
    Resource Category : Databases -> Protein Sequence Databases -> Protein Sequence Motifs and Active Sites

    Brief Description : Blocks are ungapped multiple alignments corresponding to the most conserved regions of proteins. The Blocks Database (1) consists of blocks constructed from documented families of related proteins by the automated PROTOMAT system (2). It currently contains 11,853 blocks representing 2,608 protein families documented in InterPro (3) and Prints (4). A blocks multiple alignment consists of ungapped conserved regions separated by unaligned regions of variable size. The PROTOMAT system applies a robust motif-finder (5) to a set of related protein sequences. Resulting candidate motifs are assembled into a best set along the lengths of the sequences, and additional sequences may be added if they are known to be related and can be aligned with all of the resulting blocks for a family. In order to reduce the redundancy and size of the Blocks Database, PROTOMAT is applied to families of sequences documented in InterPro in a heirarchical manner by perceived quality of documentation and lack of family inter-relationships. Since the Prints Database format is consistent with that of Blocks, Prints blocks are added directly without running PROTOMAT. The LAMA algorithm (6) is used to compare all blocks added to the Blocks Database with each other to further reduce redundant entries. The Blocks Database is used to annotate proteins of unknown function (16). Protein or DNA sequence can be compared with the Blocks Database using the BLIMPS searching tool (7), or the IMPALA and RPS-BLAST tools from NCBI (8, 9), all of which provide statistics to evaluate hits. When a search of the Blocks Database hits a protein family, the user is linked to the InterPro documentation page. The Blocks WWW Server also provides several tools that enhance the information found there. Visual displays of the blocks for a family are provided by sequence logos (10), by maps of the blocks along the sequences, and by highlighting the blocks on known structures of the sequences in them. Structures can be viewed with the ProWeb PDB Viewer, or with a browser helper application. A phylogenetic tree is made from the blocks representing each family and can be explored with the ProWeb tree viewer. Reverse searches of the block alignments against sequence databases are facilitated by links to BLAST (9, 11), MAST (12) and LAMA (6) searching pages. Blocks are linked to the CODEHOP PCR primer design tool (14), which uses the multiple alignment to design hybrid consensus-degenerate primers. Blocks are also linked to SIFT (15), a program which predicts the effect of amino acid substitutions from multiple alignment information. Finally, links are provided from the Blocks Database to CYRCA sets of consistently aligned blocks (13). Each CYRCA set contains similar blocks identified from consistent LAMA alignments of pairs of blocks. Each set usually contains conserved regions of similar function and structure that appear in different contexts. For blocks that have members with known structures, CYRCA has a tool to superimpose the structures according to their alignment in these sets. All of these tools except CYRCA are also available when users make blocks from their own sequences or excise blocks from their own multiple alignments using the 'Block Maker' and 'Multiple Alignment Processor' features.
    Subject Area : Proteins


    Institute/s :
    Fred Hutchinson Cancer Research Center, Seattle, WA, USA
    Address of Institute/s :
    Fred Hutchinson Cancer Research Center, Seattle, WA, USA
    Country : United States

    Associated Institutes :

    • Fred Hutchinson Cancer Research Center, Seattle, WA, USA
    • Howard Hughes Medical Institute, USA
    • Weizmann Institute of Science, Rehovot, Israel

    Associated Country : USA; Israel


    Authors/Contributors : Jorja G Henikoff; Elizabeth A Greene; Shmuel Pietrokovski ;Steven Henikoff
    Contact Email : jorja@fhcrc.org
    Year : 1999
    Language : English

    Keywords : Amino Acid Sequence; Databases, Factual; Information Storage and Retrieval; Internet; Molecular Sequence Data; Proteins / chemistry; Sequence Homology, Amino Acid


  2. Blocks (View Publication)
    Full Name of the Resource : Alignments of conserved regions in protein families
    Resource Category : Databases -> Protein Sequence Databases -> Protein sequence motifs and active sites

    Brief Description : Blocks are ungapped multiple alignments corresponding to the most conserved regions of proteins. The Blocks Database consists of blocks constructed from documented families of related proteins by the automated PROTOMAT system. It currently contains 11,853 blocks representing 2,608 protein families documented in InterPro and Prints. A blocks multiple alignment consists of ungapped conserved regions separated by unaligned regions of variable size. The PROTOMAT system applies a robust motif-finder to a set of related protein sequences. Resulting candidate motifs are assembled into a best set along the lengths of the sequences, and additional sequences may be added if they are known to be related and can be aligned with all of the resulting blocks for a family. In order to reduce the redundancy and size of the Blocks Database, PROTOMAT is applied to families of sequences documented in InterPro in a heirarchical manner by perceived quality of documentation and lack of family inter-relationships. Since the Prints Database format is consistent with that of Blocks, Prints blocks are added directly without running PROTOMAT. The LAMA algorithm is used to compare all blocks added to the Blocks Database with each other to further reduce redundant entries. The Blocks Database is used to annotate proteins of unknown function. Protein or DNA sequence can be compared with the Blocks Database using the BLIMPS searching tool, or the IMPALA and RPS-BLAST tools from NCBI, all of which provide statistics to evaluate hits. When a search of the Blocks Database hits a protein family, the user is linked to the InterPro documentation page. The Blocks WWW Server also provides several tools that enhance the information found there. Visual displays of the blocks for a family are provided by sequence logos, by maps of the blocks along the sequences, and by highlighting the blocks on known structures of the sequences in them. Structures can be viewed with the ProWeb PDB Viewer, or with a browser helper application. A phylogenetic tree is made from the blocks representing each family and can be explored with the ProWeb tree viewer. Reverse searches of the block alignments against sequence databases are facilitated by links to BLAST, MAST and LAMA searching pages. Blocks are linked to the CODEHOP PCR primer design tool, which uses the multiple alignment to design hybrid consensus-degenerate primers. Blocks are also linked to SIFT, a program which predicts the effect of amino acid substitutions from multiple alignment information. Finally, links are provided from the Blocks Database to CYRCA sets of consistently aligned blocks. Each CYRCA set contains similar blocks identified from consistent LAMA alignments of pairs of blocks. Each set usually contains conserved regions of similar function and structure that appear in different contexts. For blocks that have members with known structures, CYRCA has a tool to superimpose the structures according to their alignment in these sets. All of these tools except CYRCA are also available when users make blocks from their own sequences or excise blocks from their own multiple alignments using the Block Maker; and Multiple Alignment Processor features.
    Subject Area : Conserved Region of Protein


    Institute/s :
    Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, USA
    Address of Institute/s :
    Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109-1024, USA
    Country : United States

    Associated Institutes :

    • Fred Hutchinson Cancer Research Center, Seattle, WA, USA
    • Howard Hughes Medical Institute
    • Weizmann Institute of Science, Rehovot, Israel

    Associated Country : USA; Israel


    Authors/Contributors : Jorja G. Henikoff
    Contact Email : jorja@fhcrc.org
    Year : 1999
    Language : English

    Keywords : Amino Acid Sequence; Databases, Factual; Information Storage and Retrieval; Internet; Molecular Sequence Data; Proteins / chemistry; Sequence Homology, Amino Acid