1 - 21 of 21 results

GATK / Genome Analysis ToolKit

star_border star_border star_border star_border star_border
star star star star star
Focuses on variant discovery and genotyping. GATK provides a toolkit, developed at the Broad Institute, composed of several tools and ables to support projects of any size. The application compiles an assortment of command line allowing one to analyze of high-throughput sequencing (HTS) data in various formats such as SAM, BAM, CRAM or VCF. The website includes multiple documentation for guiding users.


star_border star_border star_border star_border star_border
star star star star star
Allows users to interact with high-throughput sequencing data. SAMtools permits the manipulation of alignments in the SAM/BAM/CRAM formats: reading, writing, editing, indexing, viewing and converting SAM/BAM/CRAM format. It limits the mapping quality of reads with excessive mismatches and applies base alignment quality to fix alignment errors. This tool can sort and merge alignments, remove polymerase chain reaction (PCR) duplicates or generate per-position information.


star_border star_border star_border star_border star_border
star star star star star
A software suite for the comparison, manipulation and annotation of genomic features in browser extensible data (BED) and general feature format (GFF) format. BEDTools also supports the comparison of sequence alignments in BAM format to both BED and GFF features. The tools are extremely efficient and allow the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks. BEDTools can be combined with one another as well as with standard UNIX commands, thus facilitating routine genomics tasks as well as pipelines that can quickly answer intricate questions of large genomic datasets.

GATK-Queue / Genome Analysis Toolkit-Queue

A command-line scripting framework for defining multi-stage genomic analysis pipelines combined with an execution manager that runs those pipelines from end-to-end. Often processing genome data includes several steps to produces outputs, for example our BAM to VCF calling pipeline include among other things: local realignment around indels; emitting raw SNP calls; emitting indels, masking the SNPs at indels; annotating SNPs using chip data; labeling suspicious calls based on filters; creating a summary report with statistics. Running these tools one by one in series may often take weeks for processing, or would require custom scripting to try and optimize using parallel resources. With a Queue script users can semantically define the multiple steps of the pipeline and then hand off the logistics of running the pipeline to completion. Queue runs independent jobs in parallel, handles transient errors, and uses various techniques such as running multiple copies of the same program on different portions of the genome to produce outputs faster.


Examines epigenomic and transcriptomic next generation sequencing (NGS) data. Octopus-toolkit can be used for antibody- or enzyme-mediated experiments and studies for the quantification of gene expression. It can accelerate the data mining of public epigenomic and transcriptomic NGS data for basic biomedical research. This tool provides a private and a public mode: one to process the user’s own data, and the other to analyze public NGS data by retrieving raw files from the GEO database.


A high performance robust tool and library for working with SAM, BAM and CRAM sequence alignment files; the most common file formats for aligned next generation sequencing (NGS) data. Sambamba is a faster alternative to samtools that exploits multi-core processing and dramatically reduces processing time. Sambamba is being adopted at sequencing centers, not only because of its speed, but also because of additional functionality, including coverage analysis and powerful filtering capability.


Enables tightly integrated comparative variant analysis and visualization of thousands of next generation sequencing (NGS) data samples and millions of variants. BasePlayer is a highly efficient and user-friendly software for biological discovery in large-scale NGS data. It transforms an ordinary desktop computer into a large-scale genomic research platform, enabling also a non-technical user to perform complex comparative variant analyses, population frequency filtering and genome level annotations under intuitive, scalable and highly-responsive user interface to facilitate everyday genetic research as well as the search of novel discoveries.


Identifies regions with a bad, acceptable and good coverage in sequence alignment data available as bam files. BadRegionFinder offers various visual and textual types of output. The whole genome may be considered as well as a set of target regions. The tool can return detailed output files considering every base that is or should be covered and an overview file considering the coverage of the different genes that were targeted. The software may be used to determine user-definable, basewise quantiles over all samples at any position.

VarAFT / Variant Annotation and Filter Tool

Annotates and filtrates variant files. VarAFT allows the comparison of several individuals and the collection of relevant information about the variations. It includes a coverage analysis module to easily visualize regions that are poorly covered though tables and dynamic charts. With VarAFT, users can annote variant (VCF) files, combine multiple samples from various individuals, prioritize list of variants by multi-filtering parameters. Additionnaly, users can perform a coverage analysis and quality check from any BAM file.


A statistical method to analyse genomic sequences. BpMatch computes the coverage of a source sequence S on a target sequence T, by taking into account direct and reverse segments, eventually overlapped. BpMatch can be used for all the same applications for which transformation distance (TD) is used, with the advantages of a strong reduction of computational time-space complexity and the ability to consider overlapped segments. BpMatch could strongly improve the results of such a phylogenetic investigation.