Bioinformatics World    
         
 Main Index -> OTHER GENOMES
                -> Multi-Species Portals
                -> Single Species
               
                      
Navigate    AtoZ   Search this Site   Site Journal    FAQ Index   Main Index   Appendix       
          
Multi-Species Portals
Linkpage 1 -  
Genome Channel
(ORNL)
Genome Channel, developed by the Oak Ridge National Laboratory (ORNL) provides a vast overview of sequencing projects in many organisms (especially bacterial). From eukaryotes, only human, mouse and yeast available. BLAST and keyword search possible. 
Linkpage 2 -  
GOLD - Genomes OnLine Database
(Lawrence Berkeley Laboratory and others)
The Genomes OnLine Database (GOLD) is a World Wide Web resource for comprehensive access to information regarding complete and ongoing genome projects around the world. GOLD provides the largest available and most detailed monitoring of genome sequencing projects.
Ensembl
(EBI + Sanger)

including:

TraceView

Ensembl is a joint project between EMBL-EBI and the Sanger Centre to develop a software system which produces and maintains automatic annotation on eukaryotic genomes

1. Ensembl Genomes: Starting at the homepage, there is access to a variety of organism specific genomes like Human,  Mouse, Drosophila, Zebrafish, and Mosquito.
In principle, similiar remarks are valid also here as stated for the Ensembl human genome, refer to this section.

2. TraceView: the Ensembl trace repository provides a permanent archive for single pass DNA sequencing reads and associated traces and quality values. You may search the whole archive or subsets using SSAHA, a new fast equivalent of BLAST, or download sets of read sequences in fasta format and associated quality values by FTP.
HGSC -
Human Genome Sequencing Center
(Baylor College of Medicine)
The Human Genome Sequencing Center (HGSC) at Baylor College of Medicine is involved in many genomic and cDNA sequencing projects.  The "Overview" page provides access to all these projects, including species like Human, Mouse, Rat, Drosophila, Rhesus monkey (Macaca mulatta), Chimpanzee, and more.
NCBI - multispecies genome resources
(NCBI)

including:
Genomic Biology

Entrez Genome

Entrez Genome Project

Trace Archive


1. Genomic Biology: A good site to start with when looking for genomic data from various organisms like Human, Mouse, Rat, Zebrafish, Drosophila, Plants, Microbial Genomes, Malaria, and Retroviruses. The links lead to access of NCBI-based software and tools and to other non-NCBI genome databases.

2. Entrez Genome provides a complete list of all eukaryote sequencing efforts for which genomic data is publicly available. This includes large-scale, small-scale genome sequencing projects, as well as EST, GSS, or cDNA sequencing projects.

2.1. Entrez Genome -  Microbial genomes is a linkpage to numerous microbial genome projects.
2.2. Entrez Genome -  Plant Genomes Central: Collection of plant genome sequencing projects.

3. Entrez Genome Project: The NCBI Entrez Genome Project database is intended to be a searchable collection of complete and incomplete (in-progress) large-scale sequencing, assembly, annotation, and mapping projects for cellular organisms. The database is organized into organism-specific overviews that function as portals from which all projects in the database pertaining to that organism can be browsed and retrieved. For example, you will find all information regarding the human genome project on this project page. Note: Although the Entrez Genome Project database does not include viral genome sequencing projects, data from those projects are submitted to GenBank and are available in the Entrez Nucleotide and Entrez Genome databases.
 
4. The Trace Archive has been developed to store the raw genomic data underlying all of the sequence generated by genome projects. This archive is exchanging data regularly with the Ensembl Trace Server. You can BLAST these sequences with MegaBLAST and Cross species MegaBLAST. The latter version is designed specifically to identify diverged sequences with a low degree of identity, where the standard MegaBLAST is not effective.    
TIGR genome resources
(TIGR, Rockville, MD, USA)

including:
TIGR Genome Projects

TIGR CMR - Comprehensive Microbial Resource

TIGR UFMG - Unfinished Microbial Genomes

TIGR Eukaryotic Projects

TIGR Parasites Databases

TIGR Fungal Databases
1. TIGR Genome Projects: are a collection of curated databases containing DNA and protein sequence, gene expression, cellular role, protein family, and taxonomic data for microbes, plants and humans. Anonymous FTP access to sequence data is also provided.

2.
TIGR CMR (Comprehensive Microbial Resource): CMR  is a tool that allows the researcher to access all of the bacterial genome sequences completed to date. For each genome not sequenced at TIGR two kinds of annotation are displayed: the Primary annotation taken from the genome sequencing center and the TIGR annotation generated by an automated annotation process at TIGR.

3.
TIGR Unfinished Microbial Genomes (UFMG): lists of genomes currently being sequenced and also lists of completed microbial genomes. You may also run a BLAST search on the unfinished genomes.

4.
TIGR Eukaryotic Projects: lists of eukaryotic genome sequencing projects at TIGR.

5.
TIGR Parasites Databases: Links to TIGR sequencing projects completed and underway as well as links to related world-wide sequencing efforts of a series of parasite species.

6.
TIGR Fungal Databases: Fungal genomes that are in the process of being sequenced.

7. TIGR Gene Indices: Integrating data from international EST sequencing, genome sequencing and gene research projects, Gene index are an analysis of transcribed sequences represented in the world's public EST data. 
Please refer to this main description of the TIGR Gene Indices ! 
UCSC
(University of California, Santa Cruz)

including:
BLAT

UCSC Genome Browser Gateway
1. The UCSC BLAT program allows a very fast access to the growing number of genome databases maintained at UCSC. BLAT uses query sequences to quickly find sequences of 95% and greater similarity of length 40 bases or more. The available species include Human, Mouse, Rat, Drosophila, C.elegans, Fugu, SARS, and more.
Please refer to this main chapter describing BLAT.

2. UCSC Genome Browser Gateway (multi-species):
Here, you can browse all the listed species by gene names, keywords, accession numbers, cytogenetic positions, author names (submitters), words like "zinc finger" or "huntington", and many more. In principle, it is the same site and interface as for the human genome, so please refer to the main section of the UCSC Genome Browser for a detailed description.


Single Species
Bos taurus (Cow) -
genome resources
1. Bovine Genome Project (BCM): The human genome sequencing center at Baylor College of Medicine is working to sequence and annotate the genome of the cow. You can BLAST these genomic data (only BLASTN supported).

2. BLAST Bos taurus UniGene (EST clusters) at Embnet.ch. 
C.elegans (Worm) -
genome resources

including:

WormBase
(consortium)

Proteome BioKnowledge Library
(Incyte)
1. WormBase is an international consortium of biologists and computer scientists dedicated to providing the research community with accurate, current, accessible information concerning the genetics, genomics and biology of C. elegans and some related nematodes. Founded in 2000, the WormBase Consortium of CalTech, Wellcome Trust Sanger Institute (WTSC), Cold Spring Harbor (CSHL), and the Washington University Genome Sequencing Center.
Note: WormBase can be queried using the BioMart data mining tool. Please refer also to the section WormBase MartView.

2.
The so-called Proteome BioKnowledge Library contains extensive yeast and worm genome (proteome) information; (YPD stands for yeast protein database; WormPD stands for C.elegans protein database); 
NOTE: These databases are now part of Incyte Genomics, access needs (free) registration: www.incyte.com/control/tools/proteome  Login
 
3.  You may also BLAST against the C. elegans genome starting from the "eukaryotic genomes selection page"at NCBI and choosing the C. elegans genome. 
     
4. You may also BLAST against the C. elegans genome at Embnet.ch.

5. You can also perform BLAT on the C.elegans genome at UCSC !
Danio rerio (Zebrafish) - genome resources 1. The NCBI Zebrafish Genome Resources page brings together information on diverse zebrafish-related resources from multiple centers.
 
2. You can BLAST the Zebrafish  genome at NCBI. You can choose between different databases (mRNA, EST, HTGS, Reference proteins...). 

3.
You can access the zebrafish genome at the Ensembl Site (BLAST, keyword search, chromosome browser, etc.)
 
4. BLAST Zebrafish UniGene (EST clusters) at Embnet.ch.
Drosophila melanogaster (Fruitfly) -
genome resources

including:
FlyBase
(consortium)
1. FlyBase is a comprehensive database for information on the genetics and molecular biology of Drosophila. It includes data from the Drosophila Genome Projects and data curated from the literature.

2. The drosophila genome is also available from the Ensembl website !

3. NCBI
provides a genome browser and a BLAST option for the drosophila genome.
 
4. You may also BLAST the Drosophila genome starting from the  "eukaryotic genomes selection page" at NCBI and choosing the Drosophila genome.

5. You may selectively BLAST Drosophila ESTs at Embnet.ch.

6. You can also perform BLAT on the Fruitfly genome at UCSC !
Macaca mulatta (Rhesus monkey) -
genome resources
1. At HGSC, you will find an overview of the Rhesus monkey genome project, with several links.

2. Rhesus monkey genomic sequences are blastable at the NCBI Trace archive, using  Cross-species Mega BLAST or "normal" Mega BLAST.

3. It is now (May 2005) possible to perform BLAT search on the Rhesus monkey genome. Mmul_0.1 (UCSC version rheMac1) is a preliminary assembly.

4. The Rhesus monkey genome is also available from the Ensembl website !
Mus musculus (Mouse) - genome resources

including:
MGI - Mouse Genome Informatics
(Jackson Laboratory, Maine, USA) 

MGD - Mouse Genome Database
(Jackson Laboratory, Maine, USA)

GXD - Gene Expression Database
(Jackson Laboratory, Maine, USA)

MGS - Mouse Genome Sequence Project
(Jackson Laboratory, Maine, USA)


1. MGI (Mouse Genome Informatics) is maintained at Jackson Laboratory, Maine, USA and collects all data about mouse genes, nomenclature, map positions, individual ESTs etc. 
The main projects contributing to this resource are listed on this MGI Overview page:
1.1. MGD(Mouse Genome Database) includes data on gene characterization, nomenclature, mapping, gene homologies among mammals, sequence links, phenotypes, allelic variants and mutants, and strain data. (See About MGD.)
1.2. GXD (Gene Expression Database) integrates different types of gene expression information from the mouse and provides a searchable index of published experiments on endogenous gene expression during development. The GXD-Gene Expression page allows to query mouse expression data via different options.
1.3. MGS (Mouse Genome Sequence Project): The goal of the MGS project is to integrate emerging mouse genomic sequence data with the genetic and biological data available in MGD and GXD. MGS supports the MouseBLAST server as a sequence-level entry point into the MGI Database.

Typical MGI accessions: refer to section MGI IDs.
 
2. The mouse genome is also available from the Ensembl website !

3.
A good site for data integration is the Mouse genome resource page at NCBI.

4.
You can also BLAST the mouse genome at NCBI !
 
5. You can also perform BLAT on the mouse genome at UCSC ! In the dropdown menu "Freeze" choose the desired data freeze !
NOTE: If the BLAT links are unstable, launch the BLAT search starting from the UCSC homepage by choosing the organism and the link "Blat". 
 
6. You may also try to SSAHA search the mouse genome. SSAHA is a software tool for very fast matching and alignment of DNA sequences.
Please refer to this main description of SSAHA.

7. BLAST Mouse UniGene (EST clusters) at Embnet.ch.
Pan troglodytes (Chimpanzee) -
genome resources
1.  The NCBI Taxonomy Browser provides good insight not only into the evolutionary lineage but collects many links concerning the chimp sequencing projects.

2.
The chimp genome is also available from the Ensembl website.

3. It is also possible to BLAT search the chimpanzee genome at UCSC.
Rattus (Rat) -
genome resources

including:

RGD - Rat Genome Database
(Wisconsin Medical College)

1. The rat genome resources page of NCBI brings together information on diverse rat-related resources from multiple centers: sequence, mapping and clone information. 

2. You can BLAST the rat genome at NCBI.
 
3. The Rat genome database (RGD) is maintained at Wisconsin Medical College and is the primary repository for rat genomic data. The major new resource in v2.0 is VCMap (Virtual Comparative Map) and the comparative maps between Rat, Mouse and Human that are based on the VCMap algorithm.

4.
The University of Iowa collects data of the Rat EST Project; corresponds to the TIGR Clusters, but NO consensus sequence; some sequences are only available here.

5.
The TIGR Rat Gene Index collects Rat EST and corresponding genomic data. 

6. BLAST Rat UniGene (EST clusters) at Embnet.ch.

7. It is also possible to BLAT search the rat genome at UCSC.
Saccharomyces cerevisiae
(Bakers Yeast) -
genome resources

including:
CYGD - Comprehensive Yeast Genome Database
(MIPS, Munich, Germany)
1. The CYGD - Comprehensive Yeast Genome Database, maintained at MIPS, Munich, Germany, presents a comprehensive database which summarizes the current knowledge regarding the more than 6000 ORFs encoded by the Yeast Genome. 

2. You can BLAST the Yeast Genome at NCBI, starting from the "BLAST eukaryotic genomes" - BLAST pages.     

3. You can BLAST the yeast genome at Embnet.ch.

4. You can BLAT search the yeast genome at UCSC.

Sus scrofa (Pig) - genome resources
1. The pig genome resources page at NCBI provides a good starting point for information regarding the pig genome data. This homepage brings together information on porcine-related resources from NCBI and the pig research community.

2. The Entrez Genome Project page specific for the Pig genome provides direct links to e.g. all pig-derived sequences stored, like EST, mRNA, protein, and genomic.

3. The TIGR Pig Gene Index provides access to pig EST clusters including the "Tentative Consensus" (TC) sequences.

4. The pig ESTs are also clustered in UniGene (without attempt to produce contig sequences).

NOTE (June 2005): There is NO Ensembl database and NO BLAT search option for the pig genome, yet.
Xenopus 
(Clawed Frog) -
genome resources
1. The Sanger Center provides a BLAST server which allows a restricted search in the Xenopus tropicalis or Xenopus laevis EST database prepared from the EMBL database.
   
2. NCBI UniGene provides clusters of Xenopus laevis ESTs. 
 
3. You can BLAST Xenopus UniGene at Embnet.ch !
     
4. You can BLAT search the genome of Xenopus tropicalis.