Your browser version may not work well with NCBI's Web applications. More information here...
HomoloGene is a system for automated detection of homologs among the annotated genes of several completely sequenced eukaryotic genomes.
HomoloGene Release 62 Statistics



Initial numbers of genes from complete genomes, numbers of genes placed in a homology group, and the numbers of groups for each species.

Species   Number of Genes   HomoloGene
  Input Grouped   groups
Homo sapiens 22,849  19,964   19,351
Pan troglodytes 25,096  17,398   16,913
Canis lupus familiaris 19,766  16,732   16,294
Bos taurus 23,797  18,112   16,639
Mus musculus 25,388  21,538   19,410
Rattus norvegicus 21,991  19,092   17,865
Gallus gallus 17,959  12,988   12,279
Danio rerio 26,288* 17,789   15,288
Drosophila melanogaster 14,085  8,190   7,977
Anopheles gambiae 13,909  8,479   7,921
Caenorhabditis elegans 20,077  5,299   5,070
Schizosaccharomyces pombe 5,043  3,211   3,175
Saccharomyces cerevisiae 5,880  4,744   4,593
Kluyveromyces lactis 5,335  4,458   4,427
Eremothecium gossypii 4,722  3,949   3,940
Magnaporthe grisea 12,832  6,843   6,403
Neurospora crassa 10,079  6,128   6,122
Arabidopsis thaliana 26,981  13,374   13,041
Oryza sativa 26,887  12,973   12,603
Plasmodium falciparum 5,266  990   965


'*' indicates organisms where new genome annotation data is used in this build.


Last updated on: Mon Jul 28 2008



We have recently adopted a new build procedure that makes use of amino acid sequence searching (blastp) to find more distant relationships, but the procedure still refers to the DNA sequence for computation of some of the statistics. The matching strategy is guided by the taxonomic tree such that more closely related organisms are compared first. Moreover, HomoloGene entries now include paralogs in addition to orthologs.




Sources of Additional Information



HomoloGene entries have been augumented with homology and phenotype information drawn from the following sources.

Online Mendelian Inheritance in Man (OMIM)

Mouse Genome Informatics (MGI)

Zebrafish Information Network (ZFIN)

Saccharomyces Genome Database (SGD)

Clusters of Orthologous Groups (COG)

FlyBase

 

What's New
HomoloGene release 62 is now public. It incorporates updated annotation for Danio rerio Zv7 release (NCBI release 3.1, Jun. 12, 2008).


Tip of The Day




Related Resources


Entrez Genomes


A collection of complete genome sequences that includes more than 1000 viruses and over hundred microbes

  Archaea

  Bacteria

  Eukaryota

  Viruses



  COGs

Phylogenetic classification of proteins encoded in complete genomes.