Next-generation sequencing experiments have generated large amounts of data that are collected in public archives (primary databases). Secondary data extracted from these primary databases are available in the gene expression and disease/disorder sections.
Provides high-throughput microarray and next-generation sequence (NGS) functional genomic data sets. GEO archives raw data, processed data and metadata submitted by the research community. Its data are indexed, cross-linked and searchable. This database gives access to several tools and graphical renderings allowing users to easily explore and interpret data available on the platform. It can be useful to develop and test new hypotheses.
Enables the integrative analysis of aggregated collections of tagged gene expression signatures identified and extracted from GEO. GEN3VA can be used to identify, aggregate, and analyze themed collections of gene expression signatures from diverse but related studies. It aggregates 244 studies from young and old tissues in mammalian systems.
Provides a data repository for archiving raw sequence data. GSA is built based on INSDC data standards, structures and provides data archival services for scientific communities. This resource accepts raw sequence reads produced by a variety of sequencing platforms, stores both sequence reads and metadata, and provides free and unrestricted access to all publicly available data for worldwide scientific communities.
Gathers next generation sequencing (NGS) information. Enseqlopedia enables users to locate and contact NGS facilities around the world. It offers a collection of more than 380 NGS approaches and allows users to share their data on the web platform.
Provides a database for annotation/re-coding of sample-specific metadata from the Sequence Read Archive via biomedical ontologies. MetaSRA organizes and labels biological samples with information from the Disease Ontology, Experimental Factor Ontology, Cell Ontology, Cellosaurus and Uberon. Each sample raw metadata is annotated by using a custom automated computational pipeline. It encodes the metadata for each sample with a schema influenced by the one utilized on the ENCODE project.
Offers a repository for related with nucleotide sequencing workflows. ENA provides data model containing input information, output machine data and interpreted information. The database gathers a wide range of information as well as raw sequence data and derived data, including sequences, assemblies and functional annotation accompanied by studies and samples, to provide experimental context.