1 - 16 of 16 results


CEL-Seq provides its first single-cell, on-chip barcoding method, and we detected gene expression changes accompanying the progression through the cell cycle in mouse fibroblast cells. The pipeline consists of the following steps: (1) demultiplexing: using the barcode from R1 we split R2 reads into their original samples creating a separate file for each sample. Since the unique molecular identifier (UMI) is also read in R1 we extract it and attach it to the R2 read metadata for downstream analysis; (2) mapping: using Bowtie2, we map the reads of the different samples in parallel, cutting the analysis time by roughly the number of available cores; (3) read counting: A modified version of the htseq-count script that supports the identification and elimination of reads sharing the same UMI to generate an accurate molecule count for each feature. We use binomial statistics to convert the number of UMIs into transcript counts. The different steps in the pipeline are wrapped together in a single program with a simple configuration file allowing to control for different run modes.

UMI-tools / Unique Molecular Identifiers-tools

Demonstrates the value of properly accounting for errors in unique molecular identifiers (UMIs). UMI-tools removes PCR duplicates and implements a number of different UMI deduplication schemes. It can extract, remove and append UMI sequences from fastq reads. Compared with previous method, this one is superior at estimating the true number of unique molecules. The simulations provide an insight into the impact on quantification accuracy and indicate that application of an error-aware method is even more important with higher sequencing depth.


Allows quality control (QC) and analysis components of parallel single cell transcriptome and epigenome data. Dr.seq is a quality control (QC) and analysis pipeline that provides both multifaceted QC reports and cell clustering results. Parallel single cell transcriptome data generated by different technologies can be transformed to the standard input with contained functions. Using relevant commands, the software can also be used to report quality measurements based on four aspects and can generate detailed analysis results for scATAC-seq and Drop-ChIP datasets.


Offers tools to address most barcoding situations with and without unique molecular identifier (UMI) and the identification of polymerase chain reaction (PCR) duplicates based on extracted UMIs. In standard experimental set ups (one barcode per sample, identical barcodes at both fragments’ ends) and using equivalent options, Je demultiplex produced identical results when compared to other demultiplexing tools and performed 3.8 times faster and 4.5 times slower than the popular FASTX and eautils packages.


Processes raw reads to count tables for RNA-seq data using Unique Molecular Identifiers (UMIs). zUMIs is a pipeline applicable for most experimental designs of RNA-seq data, such as single-nuclei sequencing techniques. This method allows for down sampling of reads before summarizing UMIs per feature, which is recommended for cases of highly different read numbers per sample. zUMIs is flexible with respect to the length and sequences of the barcodes (BCs) and UMIs, making it compatible with a large number of protocols.

UMIS / Unique Molecular IdentifierS

Provides a way of removing amplification biases, the assumed absolute quantification does not appear to hold true perfectly. Umis is a flexible tool for counting the number of unique molecular identifiers. There are four steps in this method: (i) formatting reads, (ii) filtering noisy cellular barcodes, (iii) pseudo-mapping to cDNAs, and (iv) counting molecular identifiers. The quantitation used in umis handles reads that could come from multiple transcripts by assigning a fractional count to each transcript and then filtering for a minimum count at the end.