FastQC specifications

Information


Unique identifier OMICS_01043
Name FastQC
Software type Application/Script
Interface Command line interface, Graphical user interface
Restrictions to use None
Input format FASTQ
Output format HTML
Biological technology Illumina, Pacific Biosciences, Roche
Operating system Unix/Linux, Mac OS, Windows
Programming languages Java
License GNU General Public License version 3.0, GNU General Public License version 2.0
Computer skills Advanced
Version 0.11.7
Stability Stable
Requirements Picard BAM/SAM,JRE
Maintained Yes

Download


Versioning


Add your version

Maintainer


  • person_outline Simon Andrews <>

Additional information


http://www.bioinformatics.babraham.ac.uk/projects/fastqc/Help/1%20Introduction/1.1%20What%20is%20FastQC.html

FastQC citations

 (318)
2018
PMCID: 5953949

[…] bioanalyzer (agilent) and sequenced using 50 bp paired-end sequencing on an illumina hiseq 2500 (illumina) by nyu genome technology center., atac-seq fastq files were quality trimmed using fastqc (v0.11.5) and mapped to the mouse genome (mm9) using bowtie (v2.2.6)67, using the default options. mapped sam files were sorted and converted to bam files with samtools (v0.1.19-96b5f2294a)56. […]

2018
PMCID: 5946048

[…] fisher scientific, waltham, usa). the library was sequenced using paired-end sequencing on an illumina miseq platform (illumina, san diego, ca, usa). the quality of reads was checked with the fastqc tool. de novo assembly was performed with clc genomics workbench (length fraction, 0.5; similarity fraction, 0.8). annotation of the sequence was performed with the ncbi prokaryotic genome […]

2018
PMCID: 5952427

[…] were then amplified for 15 cycles using primers incorporating unique index tags. fragments were sequenced on an illumina hiseq-3000 using single reads extending 50 bases. samples were qc’d using fastqc, aligned to mm10 using star-align, and counted using htseq-count. technical replicates were collapsed in rstudio and differential expression determined using deseq2. adjusted p < 0.05 […]

2018
PMCID: 5930951

[…] in ~ 21.5 million reads per cell line. the five clonally expanded s16 cells were only sequenced once. read quality was assessed using fastqc (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) and reads were aligned to the rat rnor_5.0 (ensembl) reference genome using star [44]. default parameters were used, except only uniquely mapped reads were allowed (~ 82% of total reads […]

FastQC institution(s)
Babraham Institute, Cambridge, UK

FastQC reviews

 (2)
star_border star_border star_border star_border star_border
star star star star star

Vijay Lakhujani

star_border star_border star_border star_border star_border
star star star star star
Desktop
I am using FastQC on a daily basis for a number of samples obtained using sequencing run. I am writing this review after 3.5 years of usage.

1. Very simple tool
I must say this is a fairly simple tool used during the first step of ngs analysis i.e. quality check. A series of tests are perfomed and each test is flagged as pass/warning/fail which depends on how far it departs from the ideal expectations.It is possible that the biological. With not too many options in the UI version, its intuitive to use and work with. Even the command line version is fairly simple.

2. Support for variety of platforms
The tools supports NGS data(fatsq) files generated on many plaforms such Illumina-HiSeq, MiSeq, pacbio, 454 etc. This makes it number one choice to get a quick assesment of quality.

3. Quality check on multiple aspects:
The tool generates colored reports on 8-10 different quality assesment metrics such as read length distribution, quality distribution, per base GC content, adapter contamination level, kmer distibution etc. This provides a global picture of the complete quality of data.

4. Automation Level:
The tools is available in 2 modes - intractive and command line. THe interactive mode allows to view results for multiple files in a single
application. Alternatively, the non interactive mode(command line) generates an HTML report for each file processed.The command line version provides a fantastic utility to automate the analysis on multiple samples (paired and unpaired fastq files) at once. User can define the number of cores using the -t option for no. of threads. It also supports wild card character example the '*' for subsetting the fastq files based on their names or extension.

5. Development, support and documentation:
The tools is well maintained and being constantly developed for bug fixes and new features. It is supported on windows,linux and mac. The latest vertsion v11.5 was released on 08-03-16. Release notes are well maintained and could be found at:
https://www.bioinformatics.babraham.ac.uk/projects/fastqc/RELEASE_NOTES.txt

A well written documentation on the analysis modules is available at the below link:
https://www.bioinformatics.babraham.ac.uk/projects/fastqc/Help/

Bugs can be tracked or reported on bugzilla at the below link
http://www.bioinformatics.babraham.ac.uk/bugzilla/buglist.cgi?quicksearch=fastqc

6. Installation
Installation is quite easy for a person having intermediate knowledge of working on linux. A well written installation documentation is available at below link:
https://www.bioinformatics.babraham.ac.uk/projects/fastqc/INSTALL.txt

Requirements:
FastQC is a cross-platform application, written in java.
- java


Pros
- cross platform (runs on windows, linux and mac), user friendly
- interactive and command line mode
- comprehensive html summary reports
- suitable for data generated on variety of platforms
-


Cons
- Some of the metrics work on a subset of the overall data hence the interpretation may vary.

Arup Ghosh

star_border star_border star_border star_border star_border
star star star star star
Desktop
FastQC is one of the best tools for doing a quality check of any type of NGS. The tool provides HTML reports and able to process a batch of fastq files together.