1 - 50 of 53 results


A computational method to reconstruct full-length, paired T cell receptor (TCR) sequences from T lymphocyte single-cell RNA sequence data. TraCeR links T cell specificity with functional response by revealing clonal relationships between cells alongside their transcriptional profiles. TraCeR extracts TCR-derived sequencing reads for each cell by alignment against ‘combinatorial recombinomes’ comprising all possible combinations of V and J segments. Reads are then assembled into contiguous sequences that are analyzed to find full-length, recombined TCR sequences. Importantly, the reconstructed recombinant sequences typically contain nearly the complete length of the TCR V(D)J region and so allow high-confidence discrimination between closely related gene segments. Our method is sensitive, accurate and easy to adapt to any species for which annotated TCR gene sequences are available.


Enables the simulation of time-resolved antibody repertoires. AbSim is an R package that allows user to comprehensively control the vast biologically-relevant parameter space of antibody repertoire selection and evolution. The control of the following parameters is possible by: (i) total time of evolution, (ii) rate and method of somatic hypermutation (SHM), (iii) number and rate of recombination events, (iv) rate at which new sequences are produced, (v) baseline mutation rate, (vi) clonal frequency, and (vii) germline gene usage distribution.

RTCR / Recover TCR

A pipeline that accurately recovers T cell receptor (TCR) sequences, including rare TCR sequences, from high throughput sequencing data (including barcoded data) even at low coverage. RTCR employs a data-driven statistical model to rectify PCR and sequencing errors in an adaptive manner. Using simulations we demonstrate that RTCR can easily adapt to the error profiles of different types of sequencers and exhibits consistently high recall and high precision even at low coverages where other pipelines perform poorly. Using published real data we show that RTCR accurately resolves sequencing errors and outperforms all other pipelines.


Identifies germline V genes from expressed repertoires to a specificity of 100%. IgDiscover uses a cluster identification process to produce candidate sequences that, once filtered, results in individualized germline V gene databases. IgDiscover was tested in multiple species, validated by genomic cloning and cross library comparisons and produces comprehensive gene databases even where limited genomic sequence is available. IgDiscover analysis of the allelic content of the Indian and Chinese-origin rhesus macaques reveals high levels of immunoglobulin gene diversity in this species.


A software package that significantly improves the completeness and accuracy of TCR/IG profiling from deep sequence data and includes procedures to identify novel alleles of gene segments. The alignment step in LymAnalyzer, which is based on a fast-tag-searching algorithm, results in rapid identification of VDJ gene segments, with significantly improved accuracy and completeness compared to existing tools applied to TCR data. In addition, LymAnalyzer can be applied to IG sequences, includes an integrated single nucleotide polymorphism (SNP) calling algorithm that identifies novel alleles of the VDJ gene segments and produces lineage mutation trees to represent the affinity maturation process of the IGs. On real and simulated data sets LymAnalyzer produces highly accurate and complete results. Although, to date we have applied it to TCR/IG data from human and mouse, it can be applied to data from any species for which an appropriate database of reference genes is available.


A stand-alone pipeline primarily based in R programming for the integral analysis of B cell repertoire data generated by HTS. The pipeline integrates GNU software and in house scripts to perform quality filtering, sequencing noise correction and repertoire reconstruction based on V, D and J segment assignment, clonal origin and unique heavy chain identification. Post-analysis scripts generate a wealth of repertoire metrics that in conjunction with a rich graphical output facilitates sample comparison and repertoire mining. The performance of ImmunediveRsity was tested with raw and curated human and mouse 454-Roche sequencing benchmarks providing good approximations of repertoire structure. Although ImmunediveRsity is similar to other recently developed tools, it offers significant advantages that facilitate repertoire analysis and repertoire mining. ImmunediveRsity is open source and free for academic purposes and it runs on 64 bit GNU/Linux and MacOS.


A collection of tools for downstream analysis of Rep-Seq data, including clustering and phylogenetic analysis. TRigS assists with the determination and analysis of B-cell lineage trees from next-generation sequencing data. TRigS consists in various tools, including 1) AnnotateTree creates annotated lineage trees and sequence alignments showing the point at which amino acid substitutions occur 2) RevertToGermline uses a simple approach to infer the germline ancestor of a B-cell variable region sequence, given the IMGT junction analysis. 3) Clustering tools support the clustering of sequences for clonal analysis, and their large-scale depiction. 4) Tools for junction parsing and results manipluation support the integration of IgBLAST into an IMGT-style pipeline, and ease the processing of tab-separated analysis files.


Analyzes vast amounts of IGHV (immunoglobulin heavy-chain variable) sequences and exploring the resulting data. ImmuneDB can take as input raw FASTA/FASTQ data, identify genes, determine clones, construct lineages, as well as provide information such as selection pressure and mutation analysis. It uses an industry leading database, MySQL, to provide fast analysis and avoid the complexities of using error prone flat-files. After analysis, resulting data can then be easily visualized, queried, and exported through its web-based interface.


A dynamic programming approach to learn the distribution of rearrangement scenarios from large numbers of non-productive sequences in an efficient way. This approach is based on a Hidden Markov Models (HMM) formulation of the problem, and learns its parameters using a modified BaumWelch (BW) algorithm to avoid the full enumerations of all scenarios. We tested our software tool on sequence data for both the alpha and beta chains of the T cell receptor. To test the validity of our algorithm, we also generated synthetic sequences produced by a known model, and confirmed that its parameters could be accurately inferred back from the sequences. The inferred model can be used to generate synthetic sequences, to calculate the probability of generation of any receptor sequence, as well as the theoretical diversity of the repertoire. We estimate this diversity to be ≈ 1023 for human T cells. The model gives a baseline to investigate the selection and dynamics of immune repertoires.

SONAR / Software for the Ontogenic aNalysis of Antibody Repertoires

Annotates next-generation sequencing data, identifies transcripts in a lineage of interest, and tracks lineage development across multiple time points. SONAR is capable of automating both general repertoire analysis and specialized techniques for investigating specific lineages. It also generates figures, such as identity–divergence plots and longitudinal phylogenetic “birthday” trees, and provides interfaces to other programs such as DNAML and BEAST. This software provides a tool for the processing of large next-generation sequencing datasets and the ontogenic analysis of neutralizing antibody lineages.


Contributes self-developed plots for the descriptive comparative investigation of immune repertoires. ClonoPlot allows the visualization of observations in comparative B cell receptors (BCR) and T cell receptors (TCR) repertoire analysis. It removes any requirement for users to become familiar with the programming language R in order to create high-quality figures. The tool can be adapted by user to the latest developments in the field or incorporate additional features based on their existing in-house expertise.


Allows users to reconstruct the native T-cell receptors (TCR)αβ from single cell RNA-seq data of Ag-specific T cells and to link these with the gene expression profile of individual cells. VDJPuzzle enables analysis about TCR diversity and its relationship with the transcriptional profile of different clones. Moreover, single-cell transcriptome analysis can successfully distinguish Ag-specific T cell populations sorted directly from resting memory cells in peripheral blood and sorted after ex vivo stimulation. Moreover, it has been adapted for B-cell receptor (BCRs) and includes additional features to reliably characterizes somatic hypermutation (SHMs).

BASELINe / Bayesian estimation of Antigen-driven SELectIoN

A computational framework for Bayesian estimation of antigen-driven selection in immunoglobulin sequences based on the analysis of somatic mutation patterns. BASELINe represents a fundamental advance over previous methods by shifting the problem from one of simply detecting selection to one of quantifying selection. Along with providing a more intuitive means to assess and visualize selection, BASELINe allows comparative analysis between groups of sequences derived from different germline V(D)J segments.

IGoR / Inference and Generation Of Repertoires

Processes raw immune sequence reads from any source and learns unbiased statistics of recombination and somatic hypermutations. IGoR is a flexible computational method that outputs a whole list of potential recombination and hypermutation scenarios, with their corresponding likelihoods. It learns a context-dependent hypermutation model to identify hotspots, which allows for a comprehensive analysis of the mutational landscape of B cells receptor (BCR).


Performs error-correction of immunosequencing reads and uses mass spectra to validate the constructed antibody repertoires. IgRepertoireConstructor takes Illumina MiSeq paired end reads as an input and expect that each left and right reads overlap and cover variable region of antibody. IgRepertoireConstructor stitches paired-end reads and removes contaminations using alignment of stitched reads against Ig germline database. This procedure significantly improves quality of input reads and allows one to constructs reads covering variable region of antibody.


Provides a suite of utilities that cover a range of complex analysis tasks for immunoglobulin (Ig) repertoire sequencing data. Change-O is a suite of utilities that (i) processes the output of V(D)J alignment tools, (ii) assigns clonal clusters to Ig sequences and (iii) reconstructs germline sequences. It also offers applications to import data from the frequently used IMGT/HighV-QUEST tool and a set of utilities to perform basic database operations, such as sorting, filtering and modifying annotations.

TRIg / T-cell receptor antibody

Aligns non-regular T-cell receptor and immunoglobulin sequences. TRIg does alignments to the whole receptor gene instead of only to the coding regions. This tool applies a heuristic strategy and incorporates gene annotation to identify authentic alignments. TRIg correctly identified non-regularly recombined sequences, which could not be achieved by current programs. TRIg should benefit researches of immune system and improve the prediction of various immune diseases.


A flexible toolkit for the processing and analysis of antigen receptor repertoire sequencing data at single-cell level. The software combines bioinformatics tools for immunoglobulin sequence annotation with a relational database, where raw data and analysis results are stored and linked. sciReptor supports attribution of additional data categories such as cell surface marker expression or immunological metadata. Furthermore, it comprises a quality control module as well as basic repertoire visualization tools.


forum (1)
A complementary software suite that solves a wide range of RepSeq post-analysis tasks, provides a detailed tabular output and publication-ready graphics, and is built on top of a flexible API. The main aims of the VDJtools Project are: (i) Ensure consistency between post-analysis methods and results, (ii) Save the time of bioinformaticians analyzing RepSeq data, (iii) Create an API framework facilitating development of new RepSeq analysis applications, and (iv) Provide a simple enough command line tool so it could be used by immunologists and biologists with little computational background.


Utilizes re-alignment to identify V(D)J genes and alleles after common local alignment. A methodology is developed to correct the PCR and sequencing errors, and to minimize the PCR bias among various rearranged sequences with different V and J gene families. IMonitor provides general adaptation for sequences from all receptor chains of different species and outputs useful statistics and visualizations. Usefulness of IMonitor was demonstrated on minimal residual disease detection of patients with B-cell Acute Lymphoblastic leukemia.


A pipeline to identify clonal lymphocyte populations from paired-end RNA sequencing reads, with a primary focus on sequencing mRNA from neoplastic cells. VDJSeq-Solver detects the main clone for the tissue of interest by capturing the most abundant V(D)J junction among the existing ones. It is able to characterize the V(D)J rearrangement by obtaining the exact sequence of the main clone. VDJSeq-Solver overcomes limitations of currently available lymphocyte rearrangements recognition methods that are designed to work with amplicon sequencing data and perform badly in case of irregular junction coverage. It was shown to correctly identify the main clone in case of five Mantle Cell Lymphoma samples and was further successfully tested on twelve Diffuse Large B-Cell Lymphoma samples.


Assists users in automating analysis of the generated next-generation-sequencing-spectratyping (NGS-S) data. TCRProfiler is a software that allows users (i) to determine, for each T cell receptor beta locus (TRB) chain analyzed, the exact length and sequence of the CDR3, (ii) to identify the rearranged TRBV (variable), TRBD (diversity) and TRBJ (joining) genes, (iii) to analyze nontemplate nucleotides added at the junction sites, and (iv) to exclude non-functional transcripts.


A software package to identify and analyze antigen receptors from high-throughput sequence libraries. ClonotypeR is designed to process, organize and analyze very large numbers of sequences, in the order of millions, typically produced by Roche 454 or Illumina instruments, and is made of two parts. The first contains shell scripts and reference segment sequences to produce a data file where each line represents the detection of a clonotype in a sequence read. The second part is a R module available from Bioconductor, to load and filter the data, and prepare clonotype abundance tables ready for analysis with third-party tools for differential representation analysis, sample clustering, etc.


Analyzes mutation patterns in Rep-Seq data to identify novel V segment alleles, and also constructs a personalized germline database containing the specific set of alleles carried by a subject. This information is then used to improve the initial V segment assignments from existing tools, like IMGT/HighV-QUEST. The application of TIgGER to Rep-Seq data from seven subjects identified 11 novel V segment alleles, including at least one in every subject examined. These novel alleles constituted 13% of the total number of unique alleles in these subjects, and impacted 3% of V(D)J segment assignments. These results reinforce the highly polymorphic nature of human Ig V genes, and suggest that many novel alleles remain to be discovered. The integration of TIgGER into Rep-Seq processing pipelines will increase the accuracy of V segment assignments, thus improving B-cell repertoire analyses.

IGGalaxy / ImmunoGlobulin Galaxy

A convenient web based application for analyzing next-generation sequencing results and reporting IGH gene rearrangements for both repertoire and clonality studies. IGGalaxy has two analysis options one using the built in igBLAST algorithm and the second using output from IMGT; in either case repertoire summaries for the B-cell populations tested are available. IGGalaxy supports multi-sample and multi-replicate input analysis for both igBLAST and IMGT/HIGHV-QUEST. IGGalaxy is built on top of the Galaxy framework (http://galaxyproject.org/) and is distributed as a virtual machine image.