Overview FAQ Data Source Credit Contact

Current versions of orthologous databases and the source of individual sequence names

NCBI KOG

(March, 2003 release)

SpeciesSequence Name Source
Drosophila melanogasterGenbank, GI number
Homo sapiensGenbank, "Hs"+GI number
Arabidopsis thalianaGenbank, locus tag
Caenorhabditis elegansGenbank, locus tag
Saccharomyces cerevisiaeGenbank, locus tag
Schizosaccharomyces pombeGenbank, locus tag
Encephalitozoon cuniculiGenbank, locus tag

NCBI COG

(March, 2003 release)

SpeciesSequence Name Source
All speciesGenbank, locus tag

MultiParanoid

(July, 2006 release)

SpeciesSequence Name Source
Homo sapiensEBI curated non-redundant SWISS-PROT+TREMBL+ENSEMBL version, interg8
Ciona intestinalisJGI predicted transcripts, version 1.0
Drosophila melanogasterFlybase, version 3.1
Caenorhabditis elegansWormPep, version 114

OrthoMCL DB

(verion 1.0, January, 2006 release)

*All sequences are renamed in OrthoMCL DB using 3-letter abbreviation followed by a number (see OrthoMCL DB).
SpeciesSequence Name Source
Aquifex aeolicus VF5 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Aquifex_aeolicus/
Thermotoga maritima MSB8 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Thermotoga_maritima/
Dehalococcoides ethenogenes 195 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Dehalococcoides_ethenogenes_195
Deinococcus radiodurans R1 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Deinococcus_radiodurans/
Treponema pallidum subsp. pallidum str. Nichols GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Treponema_pallidum/
Chlorobium tepidum TLS GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Chlorobium_tepidum_TLS/
Rhodopirellula baltica SH_1 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Pirellula_sp/
Chlamydophila pneumoniae CWL029 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Chlamydophila_pneumoniae_CWL029/
Synechococcus sp. WH8102 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Synechococcus_sp_WH8102/
Mycobacterium tuberculosis H37Rv GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Mycobacterium_tuberculosis_H37Rv/
Bacillus anthracis Ames Ames GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Bacillus_anthracis_Ames/
Wolinella succinogenes DSM 1740 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Wolinella_succinogenes/
Geobacter sulfurreducens PCA GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Geobacter_sulfurreducens/
Agrobacterium tumefaciens C58 Uwash GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Agrobacterium_tumefaciens_C58_UWash/
Ralstonia solanacearum GMI1000 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Ralstonia_solanacearum/
Escherichia coli K12 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Escherichia_coli_K12/
Halobacterium sp. NRC-1 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Halobacterium_sp/
Methanococcus jannaschii DSSM 2661 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Methanococcus_jannaschii/
Sulfolobus solfataricus P2 GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Sulfolobus_solfataricus/
Nanoarchaeum equitans Kin4-M GenBank, ftp://ftp.ncbi.nih.gov/genomes/Bacteria/Nanoarchaeum_equitans/
Entamoeba histolytica TIGR, ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/e_histolytica/annotation_dbs/EHA1.pep
Dictyostelium discoideum dictyBase, http://www.dictybase.org/db/cgi-bin/dictyBase/download/download.pl?area=blast_databases&ID=dicty_primary_protein.gz
Plasmodium falciparum 3D7 PlasmoDB, v4.4, http://plasmodb.org/restricted/data/P_falciparum/WG/cds.aa/Pfa3D7_WholeGenome_Annotated_PEP_2005.2.11.fasta
Plasmodium yoelii 17XNL PlasmoDB, v4.4, http://www.plasmodb.org/restricted/data/P_yoelii/Whole_genome/cds.aa/Pyoelii_WholeGenome_Annotated_PEP_2002.9.10.fasta
Plasmodium knowlesi PlasmoDB, v4.3, http://www.plasmodb.org/restricted/data/P_knowlesi/WG/cds.aa/pkn_prots.db
Cryptosporidium parvum Iowa (Type2) CryptoDB, http://cryptodb.org/download/public/pep/genomic/CpIOWA/CpIOWA_protein.gz
Cryptosporidium hominis TU502 (Type 1) CryptoDB, http://cryptodb.org/download/public/pep/genomic/ChTU502/ChTU502_protein.gz
Toxoplasma gondii ToxoDB, N/A
Theileria parva TIGR, ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/t_parva/annotation_dbs/
Saccharomyces cerevisiae S288C SGD, ftp://genome-ftp.stanford.edu/pub/yeast/sequence/genomic_sequence/orf_protein/orf_trans_all.fasta.gz
Schizosaccharomyces pombe Sanger, ftp://ftp.sanger.ac.uk/pub/yeast/pombe/Protein_data/pompep
Yarrowia lipolytica CLIB99 Genolevures, http://cbi.labri.u-bordeaux.fr/Genolevures/raw/seq/Y_lipolytica.rc2.aa
Kluyveromyces lactis CLIB210 Genolevures, http://cbi.labri.u-bordeaux.fr/Genolevures/raw/seq/K_lactis.rc2.aa
Debaryomyces hansenii CBS767 Genolevures, http://cbi.labri.u-bordeaux.fr/Genolevures/raw/seq/D_hansenii.rc2.aa
Candida glabrate CBS138 Genolevures, http://cbi.labri.u-bordeaux.fr/Genolevures/raw/seq/C_glabrata.rc2.aa
Encephalitozoon cuniculi GenBank, ftp://ftp.ncbi.nih.gov/genomes/Encephalitozoon_cuniculi/*.faa
Cryptococcus neoformans TIGR, ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/c_neoformans/annotation_dbs/CNA1.pep
Ashbya gossypii AGD, http://agd.unibas.ch/Ashbya_gossypii/downloads/AGD_ORF_translations_r2_1.fas
Neurospora crassa OR74A WI, ftp://www-genome.wi.mit.edu/pub/annotation/N_crassa/release7/neurospora_crassa_7_proteins.fasta.gz
Cyanodioschyzon merolae 10D UTOKYO, http://merolae.biol.s.u-tokyo.ac.jp/download/cds.fasta.txt
Thalassiosira pseudonana JGI, ftp://ftp.jgi-psf.org/pub/JGI_data/Diatom/thaps1/thaps1Prots.fasta.gz
Arabidopsis thaliana TIGR, ftp://ftp.tigr.org/pub/data/a_thaliana/ath1/SEQUENCES/ATH1.pep.gz
Oryza sativa TIGR, ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotation_dbs/BAC_PAC_clones/OSA1.pep
Caenorhabditis elegans WORMBASE, ftp://ftp.wormbase.org/pub/wormbase/data_freezes/WS140/acedb/wormpep140.tar.gz
Caenorhabditis briggsae SANGER, ftp://ftp.sanger.ac.uk/pub/wormbase/cbriggsae/cb25.agp8/brigpep2.pep.gz
Drosophila melanogaster ensembl, ftp://ftp.ensembl.org/pub/fly-30.3d/data/fasta/pep/Drosophila_melanogaster.BDGP3.2.1.apr.pep.fa.gz
Anopheles gambiae PEST ensembl, ftp://ftp.ensembl.org/pub/anopheles-30.2e/data/fasta/pep/Anopheles_gambiae.MOZ2a.apr.pep.fa.gz
Ciona intestinalis JGI, ftp://ftp.jgi-psf.org/pub/JGI_data/Ciona/v1.0/ciona.prot.fasta.gz
Fugu rubripes ensembl, ftp://ftp.ensembl.org/pub/fugu-30.2e/data/fasta/pep/Fugu_rubripes.FUGU2.apr.pep.fa.gz
Tetraodon nigroviridis ensembl, ftp://ftp.ensembl.org/pub/tetraodon-30.1b/data/fasta/pep/Tetraodon_nigroviridis.TETRAODON7.apr.pep.fa.gz
Danio rerio ensembl, ftp://ftp.ensembl.org/pub/zebrafish-30.4c/data/fasta/pep/Danio_rerio.ZFISH4.apr.pep.fa.gz
Homo sapiens ensembl, ftp://ftp.ensembl.org/pub/human-30.35c/data/fasta/pep/Homo_sapiens.NCBI35.apr.pep.fa.gz
Mus musculus ensembl, ftp://ftp.ensembl.org/pub/mouse-30.33f/data/fasta/pep/Mus_musculus.NCBIM33.apr.pep.fa.gz
Rattus norvegicus ensembl, ftp://ftp.ensembl.org/pub/rat-30.34/data/fasta/pep/Rattus_norvegicus.RGSC3.4.apr.pep.fa.gz
Gallus gallus ensembl, ftp://ftp.ensembl.org/pub/chicken-30.1f/data/fasta/pep/Gallus_gallus.WASHUC1.apr.pep.fa.gz

TIGR EGO

(release 13, July, 2007)

SpeciesSequence Name Source
All speciesTIGR Gene Index Project, Tentative Consensus (TC) number