An accurate somatic mutation detection pipeline implementing a stochastic boosting algorithm to produce highly accurate somatic mutation calls for both single nucleotide variants and small insertions and deletions. The workflow currently incorporates five state-of-the-art somatic mutation callers, and extracts over 70 individual genomic and sequencing features for each candidate site. A training set is provided to an adaptively boosted decision tree learner to create a classifier for predicting mutation statuses.
(Spencer et al., 2014) Performance of common analysis methods for detecting low-frequency single nucleotide variants in targeted next-generation sequence data. The Journal of molecular diagnostics.
(Stead et al., 2013) Accurately identifying low-allelic fraction variants in single samples with next-generation sequencing: applications in tumor subclone resolution. Human mutation.
(Roberts et al., 2013) A comparative analysis of algorithms for somatic SNV detection in cancer. Bioinformatics.
(Wang et al., 2013) Detecting somatic point mutations in cancer genome sequencing data: a comparison of mutation callers. Genome medicine.