Single nucleotide variant identification software tools | High-throughput sequencing data analysis
With the advent of relatively affordable high-throughput technologies, DNA sequencing of cancers is now common practice in cancer research projects and will be increasingly used in clinical practice to inform diagnosis and treatment. Somatic (cancer-only) single nucleotide variants (SNVs) are the simplest class of mutation, yet their identification in DNA sequencing data is confounded by germline polymorphisms, tumour heterogeneity and sequencing and analysis errors.
Integrates workflow technology and in-built access to bioinformatics resources including remote data warehouses and tools. Galaxy permits users without programming skills to conduct computational analysis through the Web. It builds a succession of tools to perform multistep studies and is able to conserve the complete provenance of each analysis step. This platform offers drag and drop functionalities to ease the construction of workflows.
Assists users with DNA sequence analysis. Sequencher is a graphical application that integrates the Cufflinks suite for in-depth transcript analysis and differential gene expression of RNA-Seq data. It can generate unique data visualizations using customized plots and graphs that produces ready-to-print graphs. This model also includes an extensive Sanger analysis features.
A platform-independent mutation caller for targeted, exome, and whole-genome resequencing data generated on Illumina, SOLiD, Life/PGM, Roche/454, and similar instruments. The newest version, VarScan 2, is written in Java, so it runs on most operating systems. It can be used to detect different types of variation: 1) germline variants (SNPs and indels) in individual samples or pools of samples, 2) multi-sample variants (shared or private) in multi-sample datasets (with mpileup), 3) somatic mutations, LOH events, and germline variants in tumor-normal pairs and 4) somatic copy number alterations (CNAs) in tumor-normal exome data.
Allows users to interact with high-throughput sequencing data. SAMtools permits the manipulation of alignments in the SAM/BAM/CRAM formats: reading, writing, editing, indexing, viewing and converting SAM/BAM/CRAM format. It limits the mapping quality of reads with excessive mismatches and applies base alignment quality to fix alignment errors. This tool can sort and merge alignments, remove polymerase chain reaction (PCR) duplicates or generate per-position information.
Provides various next-generation sequencing (NGS) data analysis applications which are developed or optimized by Illumina, or from a growing ecosystem of third-party app providers. BAseSpace is a cloud platform that can be integrated with the industry’s leading sequencing platforms, without cumbersome or time consuming data transfer steps.
Focuses on variant discovery and genotyping. GATK provides a toolkit, developed at the Broad Institute, composed of several tools and able to support projects of any size. The application compiles an assortment of command line allowing one to analyze of high-throughput sequencing (HTS) data in various formats such as SAM, BAM, CRAM or VCF. The website includes multiple documentation for guiding users.