Main logo
?
tutorial arrow
×
Submit new tools
Share tools covering the current topic. Provide easy-to-follow guidelines to improve their usability.
Share new tools with the community
Sign up for free to promote the availability of bioinformatics tools

Genomic databases | Genome annotation

Genome annotation information is available from many sources including publications on the sequencing and annotation of genes for whole genomes, individual chromosomes, and whole-genome annotation computed by multiple bioinformatics groups. Ensembl…
FlyBase
Dataset

FlyBase

It is the leading website and database of Drosophila genes and genomes. FlyBase…

It is the leading website and database of Drosophila genes and genomes. FlyBase curates a variety of data from published biological literature, including phenotype, gene expression, interactions…

GenBank
Dataset

GenBank

A comprehensive database that contains publicly available nucleotide sequences…

A comprehensive database that contains publicly available nucleotide sequences for over 300 000 formally described species. These sequences are obtained primarily through submissions from individual…

NCBI
Dataset

NCBI National Center for Biotechnology Information

Supplies several online resources for biological information. NCBI is a…

Supplies several online resources for biological information. NCBI is a web-based platform gathering information, tools, and functions that can be useful for researchers about biology. It offers user…

Ensembl
Dataset

Ensembl

A genomic interpretation system providing the most up-to-date annotations,…

A genomic interpretation system providing the most up-to-date annotations, querying tools and access methods for chordates and key model organisms. The REST server, which allows programs written in…

EcoCyc
Dataset

EcoCyc

A scientific database for the bacterium Escherichia coli K-12 MG1655. The…

A scientific database for the bacterium Escherichia coli K-12 MG1655. The EcoCyc project performs literature-based curation of the entire genome, and of transcriptional regulation, transporters, and…

G T A T C G C T A
Highlander
Desktop

Highlander

A Java software coupled to a local database to centralize all variant data and…

A Java software coupled to a local database to centralize all variant data and annotations from the lab, and to provide powerful filtering tools that are easily accessible to the biologist. Data can…

EuMicrobedbLite
Dataset

EuMicrobedbLite

A light weight comprehensive genome resource and sequence analysis platform for…

A light weight comprehensive genome resource and sequence analysis platform for oomycete organisms. EuMicrobedbLite is a successor of the VBI Microbial Database (VMD) that was built using the Genome…

Genome Project…
Dataset

Genome Project of Streptomyces avermitilis

Collects information from Streptomyces avermitilis. Genome Project of…

Collects information from Streptomyces avermitilis. Genome Project of Streptomyces avermitilis contains genome sequence, protein-coding sequence, stable RNA sequence and an annotation table. Beside,…

GCGene
Dataset

GCGene Gastric Cancer Gene database

A literature-based database with comprehensive annotations supported by a…

A literature-based database with comprehensive annotations supported by a user-friendly website. In the current release, we have collected 1,815 unique human genes including 1,678 protein-coding and…

IGC
Dataset

IGC integrated gene catalog

Represents a comprehensive resource for further investigations of the gut…

Represents a comprehensive resource for further investigations of the gut microbiome, covering strains with a diverse range of occurrence frequencies. IGC allows rapid and multi-omic profiling of the…

EBI
Dataset

EBI EMBL-EBI - The European Bioinformatics Institute

Supplies an access to several biological data resources. EBI is a database that…

Supplies an access to several biological data resources. EBI is a database that covers the entire range of biological sciences: raw DNA sequences to curated proteins, chemicals, structures, systems,…

IRGSP
Dataset

IRGSP International Rice Genome Sequencing Project

Provides a complete finished quality sequence of the rice genome (Oryza sativa…

Provides a complete finished quality sequence of the rice genome (Oryza sativa L. ssp. japonica cv. Nipponbare). IRGSP contains high-quality map-based draft sequence.

CGD
Dataset

CGD Candida Genome Database

Provides gene, protein and sequence information for multiple Candida species.…

Provides gene, protein and sequence information for multiple Candida species. CGD contains web-based tools for accessing, analyzing and exploring these data, to facilitate and accelerate research…

modENCODE
Dataset

modENCODE

Provides the biological research community with a comprehensive encyclopedia of…

Provides the biological research community with a comprehensive encyclopedia of genomic functional elements in the model organisms C. elegans and D. melanogaster. modENCODE is run as a Research…

Global Ocean…
Dataset

Global Ocean Microbiome

Provides global biodiversity resources for larger organismal size spectra.…

Provides global biodiversity resources for larger organismal size spectra. Global Ocean Microbiome is a gene catalogue and analysis of ocean microbes in their environmental context across three depth…

CoReCG
Dataset

CoReCG Colon Rectal Cancer Gene Database

Contains 2056 colon-rectal cancer genes information involved in distinct…

Contains 2056 colon-rectal cancer genes information involved in distinct colorectal cancer stages sourced from published literature with an effective knowledge based information retrieval system.…

UCSC Genome Browser
Web

UCSC Genome Browser University of California Santa Cruz Genome Browser

Displays the assembled human genome and other mammalian genomes. UCSC Genome…

Displays the assembled human genome and other mammalian genomes. UCSC Genome Browser provides browsers for more than 180 assemblies and over 100 species. It provides a collection of tools to explore…

MSGene
Dataset

MSGene Metastasis Suppressor Gene Database

The first literature-based gene resource for exploring human metastasis…

The first literature-based gene resource for exploring human metastasis suppressor genes (MS genes) to unveil the cellular complexity of MS genes. MSGene database stores 194 human MS genes (161…

dbEMT
Dataset

dbEMT

A literature-based gene resource for exploring epithelial-mesenchymal…

A literature-based gene resource for exploring epithelial-mesenchymal transition (EMT)-related human genes. dbEMT includes literature data, clinical relevant variants, gene expression profiles and…

RefSeq
Dataset

RefSeq Reference Sequence

Maintains and curates a publicly available database of annotated genomic,…

Maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records. The RefSeq project leverages the data submitted to the International Nucleotide…

Gene
Dataset

Gene

Gathers gene-specific information from several sources such as Gene Ontology…

Gathers gene-specific information from several sources such as Gene Ontology (GO). Gene compiles genomes that are completely represented by whole genome shotgun (WGS) assemblies with a unique GeneID.…

Heliagene
Dataset

Heliagene

Provides visualization, querying tools for data mining and network exploration…

Provides visualization, querying tools for data mining and network exploration around Helianthus annuus. Heliagene offers a reference genome and companion resources to accelerate breeding programs.…

Pseudomonas…
Dataset

Pseudomonas Genome Database

Collaborates with an international panel of expert Pseudomonas researchers to…

Collaborates with an international panel of expert Pseudomonas researchers to provide high quality updates to the PAO1 genome annotation and make cutting edge genome analysis data available. The…

BioGPS
Dataset

BioGPS

A centralized gene-annotation portal that enables researchers to access…

A centralized gene-annotation portal that enables researchers to access distributed gene annotation resources. The unique features of BioGPS, compared to those of other gene portals, are its…

SGD
Dataset

SGD Saccharomyces Genome Database

Compiles comprehensive integrated biological information about the budding…

Compiles comprehensive integrated biological information about the budding yeast Saccharomyces cerevisiae. SGD is a manually-curated database which aims to improve the discovery of functional…

Chromosome 7…
Dataset

Chromosome 7 Annotation Project

Compiles information about chromosome 7. Chromosome 7 Annotation Project…

Compiles information about chromosome 7. Chromosome 7 Annotation Project contains a collation of sequence, gene, and other annotations from databases, such as Celera published, NCBI, Ensembl, RIKEN…

CyanoBase
Dataset

CyanoBase

Provides an easy way of accessing the sequences and all-inclusive annotation…

Provides an easy way of accessing the sequences and all-inclusive annotation data on the structures of the cyanobacterial genomes. It contains cyanobacterial genomic sequences from 376 species, which…

Gramene
Dataset

Gramene

Offers comparative functional genomics in crops and model plant species.…

Offers comparative functional genomics in crops and model plant species. Gramene uses information generated from projects supported by public funds to improve the study of cross-species comparisons.…

DDBJ
Dataset

DDBJ DNA Data Bank of Japan

Maintains and provides public archival, retrieval and analytical services for…

Maintains and provides public archival, retrieval and analytical services for biological information. The contents of the DDBJ databases are shared with the US National Center for Biotechnology…

PATRIC
Dataset

PATRIC Pathosystems Resource Integration Center

Aims to assist scientists in infectious-disease research. PATRIC is a National…

Aims to assist scientists in infectious-disease research. PATRIC is a National Institute of Health (NIH) supported bioinformatics resource center that has been built to enable comparative genomic…

EuPathDB
Dataset

EuPathDB

Allows users to search eukaryotic pathogens. EuPathDB includes visualization…

Allows users to search eukaryotic pathogens. EuPathDB includes visualization and analysis tools to better understand data mining. It assists users to discover meaningful biological relationships. It…

Branchiostoma…
Dataset

Branchiostoma floridae

Offers gene annotation of Branchiostoma floridae, a lancelet of the genus…

Offers gene annotation of Branchiostoma floridae, a lancelet of the genus Branchiostoma. The genome of this species reveals that among the chordates, the morphologically simpler tunicates are…

The Apple…
Dataset

The Apple Genome and Epigenome

Gives access to the apple genome. The Apple Genome and Epigenome contains up to…

Gives access to the apple genome. The Apple Genome and Epigenome contains up to date gene annotation, transposable element (TE) annotation, genetic markers, DNA methylation data, small RNA data and…

Thermus…
Dataset

Thermus thermophilus

Presents the whole sequenced genome of Thermus thermophilus. T. thermophilus is…

Presents the whole sequenced genome of Thermus thermophilus. T. thermophilus is an extremely thermophilic bacterium, which grows optimally between 65 and 72 C. The type strain T. thermophilus HB8 was…

Flavobacterium…
Dataset

Flavobacterium psychrophilum

Presents the whole sequenced genome of Flavobacterium psychrophilum.…

Presents the whole sequenced genome of Flavobacterium psychrophilum. Flavobacterium psychrophilum is the causative agent of cold water disease in salmon and trout, which is responsible for…

Desulfovibrio…
Dataset

Desulfovibrio vulgaris

Presents the complete genome sequence of the Desulfovibrio vulgaris, a member…

Presents the complete genome sequence of the Desulfovibrio vulgaris, a member of sulfate-reducing bacteria (SRB) commonly found in a variety of soil and aquatic environments. The D. vulgaris…

Ricinus…
Dataset

Ricinus communis

A database which offers gene annotation of Ricinus communis, also known as…

A database which offers gene annotation of Ricinus communis, also known as Castorbean. The genome sequence assembly was searched for repetitive DNA using a combination of sequence alignment to…

VectorBase
Dataset

VectorBase

A National Institute of Allergy and Infectious Diseases supported…

A National Institute of Allergy and Infectious Diseases supported Bioinformatics Resource Center (BRC) for invertebrate vectors of human pathogens. VectorBase currently hosts the genomes of 35…

MaizeGDB
Dataset

MaizeGDB Maize Genetics and Genomics Database

Provides several types of information about corn. MaizeGDB is an online…

Provides several types of information about corn. MaizeGDB is an online repository offering several functions: genome browser, or bin viewer. It also proposes different tools allowing users to work…

Salmonella…
Dataset

Salmonella Typhi

Presents the whole sequenced genome of Salmonella enterica serovar Typhi (S.…

Presents the whole sequenced genome of Salmonella enterica serovar Typhi (S. typhi). S. Typhi is the aetiological agent of typhoid fever, a serious invasive bacterial disease of humans with an annual…

Pleurobrachia…
Dataset

Pleurobrachia bachei

Offers assembly and gene annotation of Pleurobrachia bachei, which is in the…

Offers assembly and gene annotation of Pleurobrachia bachei, which is in the Pleurobrachiidae family. The database sequences the Pleurobrachia bachei genome and identifies ~19,600 gene models, 96% of…

Thalassiosira…
Dataset

Thalassiosira pseudonana

A database offering assembly and gene annotation of the Thalassiosira…

A database offering assembly and gene annotation of the Thalassiosira pseudonana, a species of marine centric diatom. It is a model for diatom physiology studies, belongs to a genus widely…

PGSB PlantsDB
Dataset

PGSB PlantsDB Plant Genome and Systems Biology PlantsDB

A database framework for the comparative analysis and visualization of plant…

A database framework for the comparative analysis and visualization of plant genome data. PGSB PlantsDB has been updated with new data sets and types as well as specialized tools and interfaces to…

IMG
Dataset

IMG Integrated Microbial Genomes

Offers a collection of genomes from all three domains of life, as well as…

Offers a collection of genomes from all three domains of life, as well as viruses, plasmids and genome fragments. IMG contains biosynthetic clusters of genes associated with pathways involved in the…

CCSB-Broad…
Dataset

CCSB-Broad Lentiviral Expression Library

Allows users to search information about genome-scale expression. CCSB-Broad…

Allows users to search information about genome-scale expression. CCSB-Broad Lentiviral Expression Library is a database that provides more than 15 000 human open-reading frames (ORFs). It enables…

hORFeome…
Dataset

hORFeome Database

Provides several informations about single-colony, fully-sequenced cloned…

Provides several informations about single-colony, fully-sequenced cloned human. hORFeome Database consists in genome-scale human ORFeome collections. Users can search: (i) open-reading frames…

Nematostella…
Dataset

Nematostella vectensis

Offers gene annotation of Nematostella vectensis also known as starlet sea…

Offers gene annotation of Nematostella vectensis also known as starlet sea anemone. This genome includes approximately 7.8X whole genome sequencing (WGS) in small insert end-sequence coverage. After…

Avianbase
Dataset

Avianbase

A resource for bird genomics, which provides access to data released by the…

A resource for bird genomics, which provides access to data released by the Avian Phylogenomics Consortium. This bird portal can be tailored to the needs of the individual bird research communities.…

CCDS
Dataset

CCDS Consensus Coding Sequence

A collaborative effort to maintain a dataset of protein-coding regions that are…

A collaborative effort to maintain a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assemblies by the National Center for Biotechnology…

VIPERdb
Dataset

VIPERdb Virus Particle Explorer database

A database for icosahedral virus capsid structures. VIPERdb provides a…

A database for icosahedral virus capsid structures. VIPERdb provides a comprehensive resource specific to the needs of the structural virology community, with an emphasis on the description and…

Manihot…
Dataset

Manihot esculenta

Offers a gene annotation of Manihot esculenta also known as cassava. The…

Offers a gene annotation of Manihot esculenta also known as cassava. The Manihot esculenta database is an Illumina-based assembly from the same genotype, AM560-2. The scaffolds have been anchored…

IMG/VR
Dataset

IMG/VR

Visualizes and analyses viral sequences. IMG/VR allows users to explore…

Visualizes and analyses viral sequences. IMG/VR allows users to explore associated metadata to decipher biogeographical and habitat distribution patterns of viral species as well as traveling across…

TAIR
Dataset

TAIR The Arabidopsis Information Resource

Maintains a database of genetic and molecular biology data for the model higher…

Maintains a database of genetic and molecular biology data for the model higher plant Arabidopsis thaliana. Data available from TAIR includes the complete genome sequence along with gene structure,…

HGD
Dataset

HGD Hymenoptera Genome Database

Gathers information about genomic data of hymenopteran insects. Hymenoptera…

Gathers information about genomic data of hymenopteran insects. Hymenoptera Genome Database aims to provide a way for users to perform comparative studies and to investigate processes that are common…

Xanthomonas…
Dataset

Xanthomonas campestris

Presents the whole sequenced genome of Xanthomonas campestris. X. campestris is…

Presents the whole sequenced genome of Xanthomonas campestris. X. campestris is a major cause of black rot in crucifers, a disease that results in massive tissue degeneration. The type strain was…

Thermotoga…
Dataset

Thermotoga maritima

Presents the whole sequenced genome of Thermotoga maritima. T. maritima is a…

Presents the whole sequenced genome of Thermotoga maritima. T. maritima is a hyperthermophile with an optimum growth temperature of 80°C. The type strain T. maritima MSB8 was isolated from…

Rickettsia…
Dataset

Rickettsia prowazekii

Presents the whole sequenced genome of the obligate intracellular parasite…

Presents the whole sequenced genome of the obligate intracellular parasite Rickettsia prowazekii, the causative agent of epidemic typhus. This genome contains 834 protein-coding genes. The functional…

Mycobacterium…
Dataset

Mycobacterium leprae

Contains the entire genome of Mycobacterium leprae TN. Mycobacterium leprae has…

Contains the entire genome of Mycobacterium leprae TN. Mycobacterium leprae has the longest doubling time of all known bacteria and has thwarted every effort at culture in the laboratory. It is the…

Leptospira…
Dataset

Leptospira interrogans

Reports the complete genomic sequence of a representative virulent serovar type…

Reports the complete genomic sequence of a representative virulent serovar type strain of Leptospira interrogans serogroup Icterohaemorrhagiae consisting of a 4.33-megabase large chromosome and a…

Borreliella…
Dataset

Borreliella burgdorferi

Contains a linear chromosome of 910,725 base pairs and at least 17 linear and…

Contains a linear chromosome of 910,725 base pairs and at least 17 linear and circular plasmids with a combined size of more than 533,000 base pairs. Borrelia burgdorferi is the causative agent of…

Bacillus cereus
Dataset

Bacillus cereus

Reports the sequencing and analysis of the type strain Bacillus cereus ATCC…

Reports the sequencing and analysis of the type strain Bacillus cereus ATCC 14579. Bacillus cereus is an opportunistic pathogen causing food poisoning manifested by diarrheal or emetic syndromes. It…

Aquifex…
Dataset

Aquifex aeolicus

Contains the complete genome sequence of 1,551,335 base pairs of the…

Contains the complete genome sequence of 1,551,335 base pairs of the evolutionarily and physiologically interesting organism. Aquifex aeolicus was one of the earliest diverging, and is one of the…

Giant Panda…
Dataset

Giant Panda Database

Presents the entire panda genome sequence, as well as the annotation…

Presents the entire panda genome sequence, as well as the annotation information such as gene structure and functions, non-coding RNAs and repeat elements. The Giant Panda Database is illustrated in…

Gasterosteus…
Dataset

Gasterosteus aculeatus

Offers gene annotation of Gasterosteus aculeatus also known as three-spined…

Offers gene annotation of Gasterosteus aculeatus also known as three-spined stickleback. The assembly has been sequenced by whole-genome shotgun sequencing with a base coverage of approximately 11x.…

Latimeria…
Dataset

Latimeria chalumnae

Offers gene annotation of Latimeria chalumnae also known as West Indian Ocean…

Offers gene annotation of Latimeria chalumnae also known as West Indian Ocean coelacanth. Models built from Coelacanth proteins and cDNAs have been given priority over predictions from other…

Oryzias latipes
Dataset

Oryzias latipes

Offers gene annotation of Oryzias latipes also known as the medaka. Oryzias…

Offers gene annotation of Oryzias latipes also known as the medaka. Oryzias latipes is a model organism and is extensively used in many areas of biological research, most notably in toxicology. This…

Sus scrofa
Dataset

Sus scrofa

Offers gene annotation of Sus scrofa also known as pig. The haploid genome of…

Offers gene annotation of Sus scrofa also known as pig. The haploid genome of the domesticated pig is estimated to be 2800 Mb. The diploid genome is organized in 18 pairs of autosomes and two sex…

Ailuropoda…
Dataset

Ailuropoda melanoleuca

Offers gene annotation of Ailuropoda melanoleuca also known as giant panda. The…

Offers gene annotation of Ailuropoda melanoleuca also known as giant panda. The giant panda genome was sequenced using Illumina dye sequencing. Its genome contains 20 pairs of autosomes and one…

Canis lupus…
Dataset

Canis lupus familiaris

Offers gene annotation of Canis lupus familiaris also known as domestic dog. It…

Offers gene annotation of Canis lupus familiaris also known as domestic dog. It consists of 39 chromosomes (1-38 and X) and 15 unplaced scaffolds. Approximately 31.5 million sequence reads were…

Pan paniscus
Dataset

Pan paniscus

Offers gene annotation of Pan paniscus also known as bonobo. The bonobo genome…

Offers gene annotation of Pan paniscus also known as bonobo. The bonobo genome shows that more than 3% of the human genome is more closely related to either bonobos or chimpanzees than these are to…

Pan troglodytes
Dataset

Pan troglodytes

Offers gene annotation of Pan troglodytes also known as common chimpanzee. The…

Offers gene annotation of Pan troglodytes also known as common chimpanzee. The chimpanzee is an important model to study biology, disease, and evolution. Research with Pan troglodytes has provided…

Gorilla gorilla
Dataset

Gorilla gorilla

Offers gene annotation of Gorilla gorilla also known as Western lowland…

Offers gene annotation of Gorilla gorilla also known as Western lowland gorilla. Sequencing was undertaken using two separate methods: traditional capillary whole-genome shotgun (WGS) sequencing and…

Hydra…
Dataset

Hydra magnipapillata

Offers assembly and gene annotation of Hydra magnipapillata, which is in the…

Offers assembly and gene annotation of Hydra magnipapillata, which is in the family Hydridae. The Hydra genome has been shaped by bursts of transposable element expansion, horizontal gene transfer,…

Trichoplax…
Dataset

Trichoplax adhaerens

A database which offers assembly and gene annotation of Trichoplax adhaerens,…

A database which offers assembly and gene annotation of Trichoplax adhaerens, which is in the Metazoa family. The database reports the sequencing and analysis of the 98 million base pair nuclear…

Amphimedon…
Dataset

Amphimedon queenslandica

Offers assembly and gene annotation of Amphimedon queenslandica, which is in…

Offers assembly and gene annotation of Amphimedon queenslandica, which is in the Niphatidae family. Amphimedon queenslandica is remarkably similar to other animal genomes in content, structure and…

Yersinia pestis
Dataset

Yersinia pestis

Offers assembly and gene annotation of Yersinia pestis, which is in the…

Offers assembly and gene annotation of Yersinia pestis, which is in the Enterobacteriaceae family. Yersiniae consist of 11 species that have been traditionally distinguished by DNA-DNA hybridisation…

Ascaris suum
Dataset

Ascaris suum

Offers assembly and gene annotation of Ascaris suum also known as large…

Offers assembly and gene annotation of Ascaris suum also known as large roundworm of pigs, which is in the Ascarididae family. The database reports the 273 megabase draft genome of Ascaris suum and…

Triticum Urartu
Dataset

Triticum Urartu

A database which offers gene annotation of triticum urartu is the diploid…

A database which offers gene annotation of triticum urartu is the diploid progenitor of the bread wheat A-genome. Also known as red wild einkorn, is a diploid species whose genome is the A genome of…

Anolis…
Dataset

Anolis carolinensis

A database which offers gene annotation of Anolis carolinensis also known as…

A database which offers gene annotation of Anolis carolinensis also known as Carolina anole an arboreal lizard. The anole lizard genome is composed of 13 chromosomes, assembled from 41.9861 contigs…

Tetraodon…
Dataset

Tetraodon nigroviridis

A database which offers gene annotation of Tetraodon nigroviridis also known as…

A database which offers gene annotation of Tetraodon nigroviridis also known as green spotted puffer. The Tetraodon genome was sequenced using the whole-genome shotgun (WGS) approach. The Tetraodon…

Carica papaya
Dataset

Carica papaya

A database which offers gene annotation of Carica papaya. The papaya genome is…

A database which offers gene annotation of Carica papaya. The papaya genome is three times the size of the Arabidopsis genome, but contains fewer genes, including significantly fewer disease…

Aegilops…
Dataset

Aegilops tauschii

A database which offers assembly and gene annotation of Aegilops tauschii, also…

A database which offers assembly and gene annotation of Aegilops tauschii, also known as Tausch's goatgrass. The diploid progenitor of the bread wheat D-genome provides important evolutionary…

Guillardia…
Dataset

Guillardia theta

A database offering assembly and gene annotation of Guillardia theta, a…

A database offering assembly and gene annotation of Guillardia theta, a cryptomonad alga. It is an example of a cell-within-a-cell, being composed of a flagellate host cell, complete with…

Bacteroides…
Dataset

Bacteroides thetaiotaomicron

Performs whole genome transcriptional profiling of E. rectale and Bacteroides…

Performs whole genome transcriptional profiling of E. rectale and Bacteroides thetaiotaomicron after colonization of gnotobiotic mice with each organism alone, or in combination under 3 dietary…

Agrobacterium…
Dataset

Agrobacterium fabrum

Consists of a circular chromosome, a linear chromosome, and two plasmids.…

Consists of a circular chromosome, a linear chromosome, and two plasmids. Agrobacterium fabrum facilitates investigations into the molecular basis of pathogenesis and the evolutionary divergence of…

GeneMap
Dataset

GeneMap

Unifies the existing genetic and physical maps with the nucleotide and protein…

Unifies the existing genetic and physical maps with the nucleotide and protein sequence databases in a fashion that should speed the discovery of genes underlying inherited human disease. The GeneMap…

Haliaeetus…
Dataset

Haliaeetus albicilla

Offers gene annotation of Haliaeetus albicilla also known as the white-tailed…

Offers gene annotation of Haliaeetus albicilla also known as the white-tailed eagle. DNA was collected from a vouchered sample (137926) of Natural History Museum of Denmark from a male caught in…

RhesusBase
Dataset

RhesusBase

Gathers informations about genome-wide macaque gene. RhesusBase includes more…

Gathers informations about genome-wide macaque gene. RhesusBase includes more than 170 million annotation records from about 1,760 next-generation sequencing (NGS) data sets. Searches can be made by…

Phytozome
Dataset

Phytozome

Provides a centralized hub for plant genome and gene family data and analysis.…

Provides a centralized hub for plant genome and gene family data and analysis. Phytozome is a comparative hub which provides a view of the evolutionary history of every plant gene at the level of…

TriTrypDB
Dataset

TriTrypDB

Provides information about Trypanosomatidae. Tritrypdb is a collective database…

Provides information about Trypanosomatidae. Tritrypdb is a collective database which intends to gather annotation, curation and access to tools enabling sophisticated queries against genomic scale…

Enterococcus…
Dataset

Enterococcus faecalis

Presents the whole sequenced genome of Enterococcus faecalis. E. faecalis is an…

Presents the whole sequenced genome of Enterococcus faecalis. E. faecalis is an important nosocomial pathogen. The type strain E. faecalis V583 is a vancomycin-resistant clinical isolate. The genome…

Legionella…
Dataset

Legionella pneumophila

Presents the genomic sequence of Legionella pneumophila, the bacterial agent of…

Presents the genomic sequence of Legionella pneumophila, the bacterial agent of Legionnaires’ disease, a potentially fatal pneumonia acquired from aerosolized contaminated fresh water. Legionella…

Deinococcus…
Dataset

Deinococcus radiodurans

Presents the complete genome sequence of the radiation resistant bacterium…

Presents the complete genome sequence of the radiation resistant bacterium Deinococcus radiodurans R1. D. radiodurans represents an organism in which all systems for DNA repair, DNA damage export,…

Ciona…
Dataset

Ciona intestinalis

Offers gene annotation of Ciona intestinalis also known as vase tunicate. This…

Offers gene annotation of Ciona intestinalis also known as vase tunicate. This genome is the smallest of any experimentally accessible chordate, and thus provides a good system for exploring…

Equus ferus…
Dataset

Equus ferus caballus

Offers gene annotation of Equus ferus caballus also known as horse. It is a…

Offers gene annotation of Equus ferus caballus also known as horse. It is a model organism for research on biomechanics and exercise physiology. The genome sequence will facilitate the identification…

Homo…
Dataset

Homo neanderthalensis

Offers gene annotation of Homo neanderthalensis. According to preliminary…

Offers gene annotation of Homo neanderthalensis. According to preliminary sequences, 99.7% of the base pairs of the modern human and Neanderthal genomes are identical, compared to humans sharing…

Macaca mulatta
Dataset

Macaca mulatta

A database which offers gene annotation of Macaca mulatta also known as rhesus…

A database which offers gene annotation of Macaca mulatta also known as rhesus macaque. Because they are genetically and physiologically similar to humans, rhesus monkeys are the most widely used…

Mnemiopsis…
Dataset

Mnemiopsis leidyi

Offers assembly and gene annotation of Mnemiopsis leidyi, which is in the…

Offers assembly and gene annotation of Mnemiopsis leidyi, which is in the Bolinopsidae family. The Mnemiopsis Genome Project Portal is intended as a resource for investigators from a number of…

CGD
Dataset

CGD Cucurbit Genomics Database

A database which offers gene annotation of cucurbit. This base offers the…

A database which offers gene annotation of cucurbit. This base offers the genome of Melon (Cucumis melo), Cucumber (Cucumis sativus), Watermelon (Citrullus lanatus), Pumpkin (Cucurbita maxima). The…

NCBI Influenza…
Dataset

NCBI Influenza Virus Resource

Allows user to study and visualize large phylogenetic trees. NCBI Influenza…

Allows user to study and visualize large phylogenetic trees. NCBI Influenza Virus Resource provides public access to influenza sequence data and a convenient interface. This platform is useful for…

DictyBase
Dataset

DictyBase

Provides information about the model organism of the social amoeba…

Provides information about the model organism of the social amoeba Dictyostelium discoideum. Dictybase contains the complete genome sequence and expression data for the organism. It offers an…

HipSci
Dataset

HipSci human induced pluripotent Stem cells initiative

Generates human induced pluripotent stem cells (iPSCs) from hundreds of healthy…

Generates human induced pluripotent stem cells (iPSCs) from hundreds of healthy individuals as well as patients diagnosed with selected diseases. HipSci is a powerful resource to evaluate and…

Viral Genomes
Dataset

Viral Genomes

A reference resource designed to bring order to this sequence shockwave and…

A reference resource designed to bring order to this sequence shockwave and improve usability of viral sequence data. The resource catalogs all publicly available virus genome sequences and curates…

Francisella…
Dataset

Francisella tularensis

Presents the whole sequenced genome of Francisella tularensis. F. tularensis…

Presents the whole sequenced genome of Francisella tularensis. F. tularensis subsp. tularensis strain SCHU S4 was first isolated from a tularemia patient in the US in 1995. Because of its high…

Chlamydia…
Dataset

Chlamydia pneumoniae

Presents the complete sequence of Chlamydia pneumoniae. C. pneumoniae is a…

Presents the complete sequence of Chlamydia pneumoniae. C. pneumoniae is a newly recognized species of Chlamydia that is a natural pathogen of humans, and causes pneumonia and bronchitis. Comparison…

Bordetella…
Dataset

Bordetella pertussis

Includes the genome of Bordetella parapertussis 12822 (4,773,551 bp; 4,404…

Includes the genome of Bordetella parapertussis 12822 (4,773,551 bp; 4,404 genes). Bordetella pertussis is related to Gram-negative β-proteobacteria that colonize the respiratory tracts of mammals.…

Xiphophorus…
Dataset

Xiphophorus maculatus

Offers gene annotation of Xiphophorus maculatus also known as southern…

Offers gene annotation of Xiphophorus maculatus also known as southern platyfish. The genus Xiphophorus is composed of 27 described species of both platyfish and swordtails. The genome assembly…

Petromyzon…
Dataset

Petromyzon marinus

Offers gene annotation of Petromyzon marinus also known as the sea lamprey. The…

Offers gene annotation of Petromyzon marinus also known as the sea lamprey. The lamprey genome may serve as a model for developmental biology as well as evolution studies involving transposition of…

Anas…
Dataset

Anas platyrhynchos

Offers gene annotation of Anas platyrhynchos also known as the wild duck. The…

Offers gene annotation of Anas platyrhynchos also known as the wild duck. The assembly comprises 78487 top level sequences, all of which are unplaced scaffolds (from 227448 contigs). The N50 of the…

Virus…
Dataset

Virus Variation Resource

Provides viral sequence data hosted by the National Center for Biotechnology…

Provides viral sequence data hosted by the National Center for Biotechnology Information. The Virus Variation Resource includes modules for seven viral groups: influenza virus, Dengue virus, West…

PULDB
Dataset

PULDB Polysaccharide-Utilization Loci DataBase

Compiles information about polysaccharide utilization locus. PULDB provides a…

Compiles information about polysaccharide utilization locus. PULDB provides a repository, part of the CAZy database, for over 3900 PUL predictions in more than 70 species. The database provides the…

Vega
Dataset

Vega Vertebrate Genome Annotation

Provides annotation manually curated of human, mouse and zebrafish genomic…

Provides annotation manually curated of human, mouse and zebrafish genomic sequences. Vega was created by merging two in-house databases at the Sanger Institute: the pipeline database containing the…

IRD
Dataset

IRD Influenza Research Database

A comprehensive, integrated database and analysis resource for influenza…

A comprehensive, integrated database and analysis resource for influenza sequence, surveillance, and research data. IRD comprises (i) a comprehensive collection of influenza virus related data…

UTRome.org
Dataset

UTRome.org

Gathers information for 3'UTR biology. UTRome.org is a resource that…

Gathers information for 3'UTR biology. UTRome.org is a resource that stores tissue-specific mRNA concerning Caenorhabditis elegans. It supplies details about structures and alternative…

ParameciumDB
Dataset

ParameciumDB

Provides data about the model organism Paramecium tetraurelia. ParameciumDB was…

Provides data about the model organism Paramecium tetraurelia. ParameciumDB was created by using components of the Generic Model Organism Database (GMOD). It offers data about gene expression data…

TADB
Dataset

TADB

Supplies information about experimentally validated type II toxin-antitoxin…

Supplies information about experimentally validated type II toxin-antitoxin (TA) pairs and the data derived from computationally predicted datasets. TADB contains over 100 pairs of experimentally…

TrichDB
Dataset

TrichDB

Genome databases for Trichomonas vaginalis. Genomic-scale data available via…

Genome databases for Trichomonas vaginalis. Genomic-scale data available via TrichDB may be queried based on BLAST searches, annotation keywords and gene ID searches, GO terms, sequence motifs and…

Gene Wiki
Dataset

Gene Wiki

Gathers Wikipedia articles about human genes. Gene Wiki aims to constitute a…

Gathers Wikipedia articles about human genes. Gene Wiki aims to constitute a continuously updated, community-reviewed and collaboratively written review article for every human gene. The database…

YGOB
Dataset

YGOB Yeast Gene Order Browser

Facilitate visual comparisons and computational analysis of synteny…

Facilitate visual comparisons and computational analysis of synteny relationships in yeasts.

SoyKB
Dataset

SoyKB Soybean Knowledge Base

A comprehensive web resource developed for bridging soybean translational…

A comprehensive web resource developed for bridging soybean translational genomics and molecular breeding research. It provides information for six entities including genes/proteins, microRNAs/sRNAs,…

PaVE
Dataset

PaVE PapillomaVirus Episteme

Provides researchers with corrected and uniformly annotated reference genomes.…

Provides researchers with corrected and uniformly annotated reference genomes. PaVE assists the study of papillomavirus biology and aids in the development of therapeutics and diagnostics. It hosts…

CryptoDB
Dataset

CryptoDB

Compiles information about Cryptosporidium. Cryptodb intends to collect whole…

Compiles information about Cryptosporidium. Cryptodb intends to collect whole genome sequence, annotation, sequence analysis and related data about this parasite. The database integrates a set of…

TBDB
Dataset

TBDB TuBerculosis DataBase

An online database providing integrated access to genome sequence, expression…

An online database providing integrated access to genome sequence, expression data and literature curation for tuberculosis (TB). TBDB currently houses genome assemblies for numerous strains of…

GreenPhylDB
Dataset

GreenPhylDB

Provides a database for comparative genomic analysis full genomes. GreenPhylDB…

Provides a database for comparative genomic analysis full genomes. GreenPhylDB is a web accessible, user-friendly comparative platform for plant genomes studies including family classification,…

ZFIN
Dataset

ZFIN The Zebrafish Information Network

Provides genetic and genomic data involving zebrafish. ZFIN is composed of…

Provides genetic and genomic data involving zebrafish. ZFIN is composed of mutants, gene expression, phenotypes, knockdown reagents, antibodies, transgenic constructs, and reporter lines. It offers…

RGD
Dataset

RGD Rat Genome Database

Provides a comprehensive data repository and informatics platform related to…

Provides a comprehensive data repository and informatics platform related to the laboratory rat, one of the most important model organisms for disease studies. Rat Genome Database (RGD) maintains and…

TGD Wiki
Dataset

TGD Wiki Tetrahymena genome database Wiki

Gathers information about Tetrahymena thermophila genome sequence. TGD Wiki…

Gathers information about Tetrahymena thermophila genome sequence. TGD Wiki provides a curation interface that allows users to update information about each gene: gene names, descriptions, Gene…

proGenomes
Dataset

proGenomes

Provides user-friendly access to currently 25 038 high-quality genomes whose…

Provides user-friendly access to currently 25 038 high-quality genomes whose sequences and consistent annotations can be retrieved individually or by taxonomic clade. These genomes are assigned to…

PlantTribes
Dataset