Unlock your biological data


Try: RNA sequencing CRISPR Genomic databases DESeq

1 - 50 of 83 results
filter_list Filters
language Programming Language
build Technology
healing Disease
settings_input_component Operating System
tv Interface
computer Computer Skill
copyright License
1 - 50 of 83 results
RAST / Rapid Annotation using Subsystem Technology
star_border star_border star_border star_border star_border
star star star star star
forum (1)
Assists in annotating complete or nearly complete bacterial and archaeal genomes. RAST is a fully-automated application provides high quality genome annotations for these genomes across the whole phylogenetic tree. It includes a user interface that allows registered users to make manual changes to their genomes before retrieving them. It was designed to extend annotations to as many protein-encoding genes in as many genomes as possible.
star_border star_border star_border star_border star_border
star star star star star
A command line software tool to fully annotate a draft bacterial genome in about 10 min on a typical desktop computer. It produces standards-compliant output files for further analysis or viewing in genome browsers. Prokka uses parallel processing to decrease running time on multicore computers. The most time-consuming steps are BLAST+ and hmmscan, which both support multiple CPUs natively. However, Prokka is more efficient if it runs multiple single CPU threads on subsets of the data, which it achieves using GNU parallel.
Supports systematic and efficient revision of microbial genome annotation, data management and comparative analysis. MicroScope allows users to analyze microbial (meta)genomes together with post-genomic experiment results if any (i.e. transcriptomics, re-sequencing of evolved strains, mutant collections, phenotype data). It combines tools and graphical interfaces to analyze genomes and to perform the expert curation of gene functions in a comparative context. MicroScope contains data for more than 6000 microbial genomes, and among the 2700 personal accounts, 14% of the users are performing expert annotations, on at least a weekly basis, contributing to improve the quality of microbial genome annotations.
star_border star_border star_border star_border star_border
star star star star star
Allows integrative investigation of next generation sequencing (NGS) microbiology data. Orione supports the whole life cycle of microbiology research data from production and annotation to publication and sharing. It can be used for a variety of microbiological projects including bacteria resequencing, de novo assembling and microbiome investigations. This tool is implemented on the Galaxy web platform.
PGAP / Prokaryotic Genome Annotation Pipeline
An automatic prokaryotic genome annotation pipeline that combines ab initio gene prediction algorithms with homology based methods. Combining the best features of the pan-genome approach in highly abundant clades with well-described and well-tested ab initio methods, PGAP now presents a flexible and extensible framework for prokaryotic annotation needs. The PGAP pipeline is designed to annotate both complete genomes and draft genomes comprising multiple contigs. PGAP is deeply integrated into NCBI infrastructure and processes, and uses a modular software framework, GPipe, developed at NCBI for execution of all annotation tasks, from fetching of raw and curated data from public repositories (the Sequence and Assembly databases) through sequence alignment and model-based gene prediction, to submission of annotated genomic data to public NCBI databases.
star_border star_border star_border star_border star_border
star star star star star
Assigns K numbers to the user's sequence data by BLAST searches, respectively, against a nonredundant set of KEGG GENES. KOALA (KEGG Orthology And Links Annotation) is KEGG's internal annotation tool for K number assignment of KEGG GENES using SSEARCH computation. Annotate Sequence in KEGG Mapper and Pathogen Checker in KEGG Pathogen are special interfaces to this server and can be executed in an interactive mode. BlastKOALA is suitable for annotating fully sequenced genomes.
Provides a more reliable annotation of both protein-coding and non-coding genes. Genix is an automated pipeline for bacterial genome annotation. This server performs the annotations one-by-one, so the time needed to annotate your genome depends not only on your sequence and the parameters you set to generate the protein database, but also on the server usage. A comparison of the results generated by Genix for four reference genomes against those generated by other annotation tools indicated that this pipeline is able to provide results that are closer to the reference genome annotation, with a smaller amount of false-positive proteins and missing functional annotated proteins. These results indicate that Genix is a useful tool that is able to provide a more refined result, and may be a user-friendly way to obtain high quality results.
MaGe / Magnifying Genomes
A microbial genome annotation system based on a relational database containing information on bacterial genomes, as well as a web interface to achieve genome annotation projects. This system allows one to initiate the annotation of a genome at the early stage of the finishing phase. MaGe's main features are (i) integration of annotation data from bacterial genomes enhanced by a gene coding re-annotation process using accurate gene models, (ii) integration of results obtained with a wide range of bioinformatics methods, among which exploration of gene context by searching for conserved synteny and reconstruction of metabolic pathways, (iii) an advanced web interface allowing multiple users to refine the automatic assignment of gene product functions.
BASys / Bacterial Annotation System
A web server that performs automated, in-depth annotation of bacterial genomic (chromosomal and plasmid) sequences. BASys uses more than 30 programs to determine nearly 60 annotation subfields for each gene, including gene/protein name, GO function, COG function, possible paralogues and orthologues, molecular weight, isoelectric point, operon structure, subcellular localization, signal peptides, transmembrane regions, reactions, and pathways. The textual annotations and images that are provided by BASys can be generated in approximately 16 hours for an average bacterial chromosome (5 Megabases. 5000 genes), or approximately 350 coding regions per hour.
RATT / Rapid Annotation Transfer Tool
Helps in genome annotation. RATT is a software that allows users to transfer any entries from a reference sequence to similar samples. It can be applied to successive versions of a genome assembly, genomes of closely related species or strains. In addition, the software is also able to detect dissimilarities between two sequences and to generate inputs making genomes’ features to be visualized with Artemis. The software is a part of PAGIT toolkit.
AMIGene / Annotation of MIcrobial Genes
An application for automatically identifying the most likely coding sequences (CDSs) in a large contig or a complete bacterial genome sequence. The first step in AMIGene is dedicated to the construction of Markov models that fit the input genomic data (i.e. the gene model), followed by the combination of well-known gene-finding methods and an heuristic approach for the selection of the most likely CDSs. The web interface allows the user to select one or several gene models applied to the analysis of the input sequence by the AMIGene program and to visualize the list of predicted CDSs graphically and in a downloadable text format.
PAGIT / Post-Assembly Genome-Improvement Toolkit
Provides a toolkit for improving the quality of genome assemblies created via an assembly software. PAGIT compiled four tools: (i) ABACAS which classifies and orientates contigs and estimates the sizes of gaps between them; (ii) IMAGE uses paired-end reads to extend contigs and close gaps within the scaffolds; (iii) ICORN for identifying and correcting small errors in consensus sequences and; (iv) RATT for help annotation. The software was mainly created to analyze parasite genomes of up to about 300 Mb.
FLAN / FLu ANnotation
Automatically annotates genomes of influenza virus A and B based on existing protein sequences in GenBank. For each segment, a set of sample protein sequences is maintained on the server. A special global protein-to-nucleotide alignment tool was designed to accurately annotate spliced genes and mature peptides of influenza viruses. The translated product from the best alignment to the sample protein sequence is used as the predicted protein encoded by the input sequence.
MICheck / MIcrobial genome Checker
Enables rapid verification of sets of annotated genes and frameshifts in previously published bacterial genomes. The web interface allows one easily to investigate the MICheck results, i.e. inaccurate or missed gene annotations: a graphical representation is drawn, in which the genomic context of a unique coding DNA sequence annotation or a predicted frameshift is given, using information on the coding potential (curves) and annotation of the neighbouring genes.
A hybrid convolutional and bi-directional long short-term memory recurrent neural network framework for predicting non-coding function de novo from sequence. In the DanQ model, the convolution layer captures regulatory motifs, while the recurrent layer captures long-term dependencies between the motifs in order to learn a regulatory ‘grammar’ to improve predictions. DanQ improves considerably upon other models across several metrics. For some regulatory markers, DanQ can achieve over a 50% relative improvement in the area under the precision-recall curve metric compared to related models.
Companion / COMprehensive Parasite ANnotatION
A web server providing parasite genome annotation as a service using a reference-based approach. Companion delivers a readily usable annotation of features in the target genome, in a variety of different formats including those required for submission to public databases. Moreover, it implements several features to highlight gene content differences between the reference and the new assembly, such as identification of orthologous clusters, species-specific singleton genes and missing core genes present in a larger reference species set. To recognize misassemblies or rearrangements, it also provides a high level visualization of sequence matches.
GAMOLA / Global Annotation of Multiplexed On-site bLasted DNA-sequences
Allows microbiologists to work with and curate draft and completed genomes. GAMOLA offers a completely local solution for annotating genomes that can tracks open reading frames (ORFs) designations and Blast results through new sequence versions. It also designs oligonucleotides for polymerase chain reaction (PCR) products to be spotted on whole genome microarrays, including an optional BlastN analysis to find potential mispriming sets.
BUSCO / Benchmarking Universal Single-Copy Orthologs
Provides measures for quantitative assessment of genome assembly, gene set, and transcriptome completeness based on evolutionarily informed expectations of gene content from near-universal single-copy orthologs selected from OrthoDB. BUSCO assessments are implemented in open-source software, with comprehensive lineage-specific sets of benchmarking universal single-copy orthologs for arthropods, vertebrates, metazoans, fungi, eukaryotes, and bacteria.
Provides parsers to process annotation data from LocusLink, Gene Ontology Consortium, and Human Gene Project and can be extended to new data sources via user defined parsers. AnnBuilder is an R package for assembling genomic annotation data differs from other existing systems in that it provides users with unlimited ability to assemble data from user selected sources. Annbuilder can perform a variety of roles. It can be used by an institution or lab to assemble data from a variety of sources into a more suitable format for the specific analyses being performed. Manufacturers of data such as Gene Expression Omnibus and NetAffx could also employ Annbuilder as a tool to help assemble the data they present.
A fully automated pipeline for structural annotation of prokaryotic genomes integrating protein similarities, statistical information and any oriented expression information (RNA-seq or tiling arrays) through a variety of file formats to produce a qualitatively enriched annotation including coding regions but also (possibly antisense) non-coding genes and transcription start sites. EuGene-PP exploits a variety of information sources, under most usual formats, to produce an annotation comparable with a curated semiautomated structural annotation, especially on ncRNA genes, which are still difficult to predict.
A prokaryotic genome annotation framework that performs intrinsic gene predictions, homology searches, predictions of non-coding genes as well as CRISPR repeats and integrates all evidence into a consensus annotation. ConsPred achieves comprehensive, high-quality annotations based on rules and priorities, similar to decision-making in manual curation and avoids conflicting predictions. Parameters controlling the annotation process are configurable by the user. ConsPred is primarily useful for the annotation of finished genome sequences or high-quality genome drafts (e.g., for submission to public databases), and for genome re-annotation in comparative genomics and functional genomics projects of prokaryotes.
SHAP / Simple High-throughput Annotation Pipeline
A lightweight and scalable sequence annotation pipeline capable of supporting research efforts that generate or utilize large volumes of DNA sequence data. SHAP provides Grid capable analysis, relational storage and Web-based full-text searching of annotation results. The software recognizes the limited resources of many smaller research groups. SHAP does not expect any prerequisite analysis tools be installed or any particular analyser be defined prior to use, but does provide a set of example definitions. In this way, this tool can be considered both a framework and a skeletonized pipeline ready to be fleshed-out by the user.
FARAO / Flexible All-Round Annotation Organizer
A set of software tools that constitute as a highly configurable way to handle sequence annotation and coverage information. FARAO 1) integrates annotation and coverage information for the same sequence set; 2) is scalable to millions of sequences and features; 3) can filter out sequences with annotations satisfying criteria given by the user; 4) handles annotations produced by a range of bioinformatics tools; and 5) provides a flexible interface for writing custom parsers for virtually any format not supported out of the box.
Sma3s / Sequence massive annotator using 3 modules
An easy-to-use tool for high throughput annotation that provides both accuracy and broad applicability for different kinds of sequence datasets. It is especially focused on the massive annotation of sequences from any kind of gene library or genome, and provides high levels of prediction accuracy (higher than 80%) with minimal manual input and low computational resource requirements. It is composed of three modules that sequentially annotate unknown sequences with increasing difficulty. All three modules use a preliminary exhaustive BLAST search as their starting point, and together generate annotations that are both highly sensitive and specific. The 3 Sma3s modules annotate sequences from: (a) already existing annotated sequences; (b) orthologous sequences and (c) groups of sequences sharing statistically significant patterns. As a result Sma3s can obtain the following biological descriptors associated to each sequence: gene ontology terms (GO), Swiss-Prot keywords and pathways, and InterPro domains.
VAX / Variant Annotator eXtras
A scalable method for using the plugin capability of the Ensembl Variant Effect Predictor to enrich its basic set of variant annotations with additional data on genes, function, conservation, expression, diseases, pathways and protein structure, and describe an extensible framework for easily adding additional custom data sets. VAX consists of a locally installed MySQL database system, which hosts the Ensembl database and custom data used by the annotator, local installations of the Ensembl Perl API and VEP and a library of custom VEP plugins.
forum (1)
A versatile, easy-to-use online automated tool for annotation of large and complex genomes. Its modular architecture allows for the annotation and masking of transposable elements, the structural, and functional annotation of protein-coding genes with an evidence-based quality indexing, and the identification of conserved non-coding sequences and molecular markers. The TriAnnot pipeline is parallelized on a 712 CPU computing cluster that can run a 1-Gb sequence annotation in less than 5 days. It is accessible through a web interface for small scale analyses or through a server for large scale annotations.
An easy-to-use web service for integrated genome annotation. MEGANTE has a simple interface to submit query sequences and performs genome annotation including repeat masking, transcript and protein alignments, computational gene prediction, similarity searches to known proteins, functional domain searches, and assignment of GO terms to the predicted genes. Multiple lines of evidence for the predictions are integrated, and an appropriate consensus exon-intron structure is selected for each locus. MEGANTE provides the results of the annotation in Microsoft Excel format and a graphical view based on a widely used genome browser, GBrowse. All data including the query sequences and annotation results are stored at the server end, enabling users to access their own data virtually from anywhere through web access. MEGANTE can accept genome sequences of 31 species in 10 families of plants.
GATU / Genome Annotation Transfer Utility
Transfers annotations from a reference genome to a closely related target genome, while still giving the user final control over which annotations should be included. GATU detects open reading frames (ORFs) present in the target but not the reference genome and provides the user with a variety of bioinformatics tools to quickly determine if these ORFs should also be included in the annotation. It greatly simplifies the initial stages of genome annotation by using a closely related genome as a reference.
Assigns functional annotations. FastAnnotator efficiently annotates sequences with their gene functions, enzyme functions or domains. It is useful in transcriptome studies and especially for those focusing on non-model organisms or metatranscriptomes. The tool integrates several well-developed annotation tools together to provide annotations for query sequences. It is capable of efficiently annotating sequences and is suitable for annotation of sequences derived from less well-studied organisms or environmental samples.
Provides a set of over 30 tools to assist researchers in the exploration of genomics and proteomics datasets. Dintor covers a wide range of frequently required functionalities, from gene identifier conversions and orthology mappings to functional annotation of proteins and genetic variants up to candidate gene prioritization and Gene Ontology-based gene set enrichment analysis. A major advantage is its capability to consistently handle multiple versions of tool-associated datasets, supporting the researcher in delivering reproducible results.
EuCAP / Eukaryotic Community Annotation Package
Incorporates structural and functional annotation from the community into an on-going annotation effort. EuCAP integrates the community annotators (CA) annotation in the current project annotation and generate web pages. The tool was applied to the rice genome. The CA models are aligned to the rice pseudomolecules and the coordinates of these alignments, along with functional annotation, are stored in the MySQL EuCAP Gene Model database. Web pages displaying the alignments of the CA models to the Osa 1 Genome models are automatically generated from the EuCAP Gene Model database. The alignments are reviewed by the project annotators in the context of experimental evidence.
GASS / Genome Annotation based on Species Similarity
A fast and easy-to-use computational tool to annotate a eukaryotic genome based on only the annotations from another similar species. Based on the alignments from the exons’ sequences of annotated species to the un-annotated genome, GASS detects the optimal transcript annotations with a shortest-path dynamic programming model. For example, GASS is applied to achieve the rhesus annotations based on its 97.5% similarity with the human genome in 5 days Blast running and 1.5 hours GASS running.
1 - 12 of 12 results
filter_list Filters
computer Job seeker
Disable 4
person Position
thumb_up Fields of Interest
public Country
language Programming Language
1 - 12 of 12 results

By using OMICtools you acknowledge that you have read and accepted the terms of the end user license agreement.