  1. Legume Information System
    Full Name of the Resource : Legume information server (formerly Medicago genome initiative): ESTs, gene expression and proteomic data
    Brief Description : The Legume Information System (LIS), formerly the Medicago Genome Initiative (MGI), is an EST sequence database and analysis system that supports EST sequencing at the Noble Foundation Center for Medicago Genome Research ( Medicago truncatula (also known as "barrel medic" because of the shape of its seed pods) is a forage and model legume that is a close relative of alfalfa and soybean. With more than 18,000 types of legumes belonging to the pea family (Leguminosae), these plans are second only to grasses in economic importance. MGI was first reported in the Nucleic Acids Research 2001 Database Issue (1), and featured a prototype database, interface and analysis pipeline. We have since developed an entirely new system that retains the advantages of the prototype, with improvements that make it more portable, modular, flexible, interactive and reusable (2). The data model is designed around the concept of an analysis operation (which may run a third-party sequence analysis tool) whose input and output consists of sets of sequences (zero, one or many sequences). This permits analysis methods that use individual (e.g. similarity search) or multiple (e.g. EST clustering) sequences to interact with the same generalized relational database structure. It also allows for the flexible addition of sequence analysis methods, and the storage and analysis of genomic DNA sequences in the same schema. The analysis pipeline is run automatically upon receipt of new sequences and can be configured to perform any series of available operations. The current suite of operations include: Import; Vector Screen; Quality Control; BLASTN search to identify non-mRNA contamination; clustering, multiple sequence alignment and extraction of a consensus; BLASTX versus a protein database; and Blocks+ (protein motif) search. Annotation is automated by linking high-scoring BLAST and Blocks+ hits to their cognate entries in the Gene Ontology database ( Users view, query and manipulate their data via a WWW browser through a completely redesigned interface running on a secure server. All analysis operations are performed on consensus sequences (gene sequences) resulting from the clustering and assembly operation, rather than on individual ESTs. MGI now incorporates all publicly available M. truncatula data available from Genbank combined with public Noble data in clustering and analysis runs. Typically the data is refreshed, including a complete reanalysis with all available new data, four times per year. As of September 2001, MGI contained over 95,000 sequences of which the 65,000 GenBank ESTs grouped into 8,843 clusters and 11,279 singletons resulting in 20,122 total analyzed consensus sequences. Clusters ranged in membership from two ESTs (3585) to 256 ESTs (one). A publicly viewable version of MGI has been deployed ( which can be accessed by following the login instructions on the main page.

    Plant Biology Division The Samuel Roberts Noble Foundation 2510 Sam Noble Parkway Ardmore, OK 73402, USA
    Country : United States

    • The National Center for Genome Resources, 2935 Rodeo Park Drive East, Santa Fe, NM 87505, USA
    • Plant Biology Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
    • Virginia Bioinformatics Institute, 1750 Kraft Drive Suite 1400, Virginia Tech, Blacksburg, VA 24061, USA

    Associated Country : USA

    Authors/Contributors : Gregory D. May
    Year : 2001
    Language : English