1 - 48 of 48 results


Allows to work about the history of recombination events that affected a given sample of bacterial genomes. ClonalOrigin run on as comparative analysis of sequences of a sample of bacterial genomes and enables to reconstruct the recombination events that have taken place in their ancestry. It contains an algorithm which permits to perform inference under this model from sequence data alignments and demonstrates that through parallelization, the inference is conceivable for whole-genome alignments.

P4P / Peptidomes for Phylogenies

Enables the in silico inference of bacterial taxonomy through the analysis of peptidomes. The P4P methodology uses whole genome data and in silico protein digestion to infer bacterial taxonomy, namely at the species and subspecies levels. The principal purpose is to generate a valid and manageable list of peptides that are potentially specific to each strain. This tool can also support accurate phylogenetic reconstruction for conventionally challenging groups of organisms.


Provides a git-based data store for archiving and curating phylogenetic estimates of species relationships. By incorporating curation into the data storage, Phylesystem have lowered the activation cost of entering data into an archive while also allowing continued curation, whether by the original authors or researchers interested in re-using these data, to improve the associated metadata. Phylesystem is part of The Open Tree of Life project and allows to complement systems by storing phylogenetic statements and associated metadata in a consistent format while retaining the history of edits that were made to the data themselves.


Uses maximum likelihood inference to simultaneously detect recombination in bacterial genomes and account for it in phylogenetic reconstruction. ClonalFrameML can analyse hundreds of genomes in a matter of hours. It implements the model underlying the popular Bayesian ClonalFrame approach in a computationally efficient manner, and we have demonstrated its ability to estimate recombination parameters and detect importation events in the context of understanding short-term transmission dynamics and long-term bacterial evolution.


Permits inspection of the topology and other aspects of large sets of phylogenetic trees in a phylogenomic context. PhyloSort is able to quantify the number of genes or trees that support different evolutionary hypotheses such as a taxonomic classification or to quantify endosymbiotic or horizontal gene transfer events. It can be useful to search for vertically inherited phylogenetic markers by testing characteristics in individual gene trees of members of each of the putative supergroups.

SPIMAP / Species Informed Maximum A Posteriori Gene Tree Reconstruction

Provides a phylogenetic reconstruction method specifically designed for reconstructing gene trees in the case of a known species tree. SPIMAP uses a Bayesian framework to model sequence evolution, gene duplication, loss, and substitution rate variation, thus incorporating many disparate types of information in a principled way. This method models rate variation that is correlated across all branches of the tree (gene-specific rate) as well as rates specific to each species lineage (species-specific rates). When both these effects are modeled, the result is a more informative prior which leads to increased reconstruction accuracy.

SCOTTI / Structured COalescent Transmission Tree Inference

Models each host as a distinct population, and transmissions between hosts as migration events. This phylogenetic software can (i) infer transmission events, (ii) account for the uncertainty associated with the possible presence of non-sampled hosts and (iii) use data from multiple samples of the same host. SCOTTI not only accounts for diversity and evolution within a host, but also for other sources of bias, namely non-sampled hosts and multiple infections of the same host. SCOTTI addresses the urgent need for software to analyse genomic and epidemiological data while accommodating for incomplete or patchy host sampling, mixed infections, and within-host variation. For these reasons, this method can help to reconstruct transmission histories in a broad range of outbreaks, both bacterial and viral.

MitoPhAST / Mitogenome Phylogenetic Analysis of Sequences Tool

Identifies annotated protein-coding gene features, generates a maximum likelihood phylogenetic tree and reports various mitochondrial genes and sequence information in a table format. The recovered phylogenetic trees using both Bayesian and ML methods support the results of studies using fragments of mtDNA and nuclear markers and other smaller-scale studies using whole mitogenomes. In comparison to the fragment-based phylogenies, nodal support values are generally higher despite reduced taxon sampling suggesting there is value in utilising more fully mitogenomic data.


Assists is carrying out and teaching meta-analysis in Ecology and Evolutionary biology (E&E). OpenMEE was developed to make advanced methods for statistical research synthesis, based on best practices, available without cost to the scientific community by providing an intuitive graphical user interface (GUI) to the diverse and growing statistical functionalities of the R ecosystem. Its interface also guides users to build appropriate synthesis models that provide high-quality analyses for the most common and important ecological questions.

HiMAP / Highly Multiplexed Amplicon-based Phylogenomics

Produces phylogenomic datasets using highly multiplexed amplicon sequencing. HiMAP employs amplicon sequencing based on highly multiplexed polymerase chain reaction (PCR) to generate its datasets. It provides features for locus selection, primer design, target amplification, sequencing, and post-sequencing data processing and analysis. The method requires minimal hands-on time at the bench, and data can be processed rapidly for consensus calls that permits avoiding read mapping or assembly.

STAG-CNS / Suffix Tree Arbitrary Gene number: Conserved Noncoding Sequence

Characterizes loss of conserved noncoding sequences associated with retained duplicate genes from the ancient maize polyploidy. STAG-CNS integrates data from the promoters of conserved orthologous genes in three or more species simultaneously. It can also be used to identify differences in the loss or retention of conserved non-coding sequences between duplicated genes. It can be employs to identify even smaller regions of conserved sequence within gene promoters with acceptable false discovery proportions.

SPIDIR / Species Informed Distance-based Reconstruction

Reconstructs gene-tree by using learning gene- and species-specific substitution rates across multiple complete genomes. SPIDIR achieves significantly higher accuracy, addresses the species-level heterotachy and enables studies of gene evolution in the context of species evolution. It uses a generative model of gene-tree evolution to calculate the likelihood of a phylogeny. The tool can be viewed as specifying a prior probability on gene-tree branch lengths, which could replace the uniform branch length prior.

PANDIT / Protein and Associated Nucleotide Domains with Inferred Trees

Assists researchers to investigate molecular evolutionary phenomena and to develop phylogenetic and comparative genomic methodology. PANDIT provides a set of aligned protein coding sequences, both at the DNA and amino acid level. It offers estimation of the evolutionary trees describing the relationship between these sequences. The database allows users to quickly and efficiently produce datasets relevant for their needs.


Enables detection of all orthologs that share a given evolutionary context. CLfinder-OrthNet identifies co-linearity (CL) in the arrangement of orthologous loci among multiple genomes and builds networks of orthologs to encode and visualize all evolutionary events, such as gene duplication, deletion, and transposition, in each orthologs group. It detects multiple forms of gene-level structural variation, including tandem duplications, deletions, transpositions, and also combinations of them.

STRIDE / Species Tree Root Inference from gene Duplication Events

Identifies sets of well-supported gene duplication events from cohorts of gene trees. STRIDE is an algorithm developed to analyze these duplication events to infer a probability distribution over an unrooted species tree for the location of the true root. This package correctly identifies the community-accepted root of the majority of species trees and effectively captures uncertainty in root placement when data is limited or conflicting.

STS / Suffix Tree Searcher

Allows the analysis of large numbers of unaligned long DNA sequences through the application of disk-based partitioned suffix trees (based on MGB’s DiGeST). STS is designed as an easy-to-use tool to index, search, and analyze very large DNA sequence datasets. Accordingly, the program is accessed through a Java Web Start link on a web page, which automatically installs or updates the program files for the user. Results are presented in tabular form, can be sorted based on multiple criteria, and are easily integrated into subsequent queries with a mouse click, providing for a natural analysis workflow. In addition, since the initial construction of suffix trees is computationally expensive, STS allows the user to load previously constructed suffix trees for analysis. Thus, once the suffix tree forest has been constructed for a given data set, future analyses can be run much more quickly by skipping the most time-consuming step.


Computes the triplet and quartet distances between general rooted or unrooted trees, respectively. The tqDist program is based on algorithms with running time O(n log n) for the triplet distance calculation and O(d.n log n) for the quartet distance calculation, where n is the number of leaves in the trees and d is the degree of the tree with minimum degree. The software package is carefully implemented in C++, and interfaces for scripting in Python (via the module pyTQDist) and in R (via the package rtqdist) are also provided.


Offers a public knowledge-base for information on the tree-of-life and its evolutionary timescale. TimeTree is a web-based resource for researchers and educators from all fields of science to assist in the discovery of patterns and the formulation of hypotheses, or simply to learn about the timetree of life. The home page describes the database and presents a choice of two user-query forms: finding the divergence of two taxa, and finding all time estimates from a particular author. The current version contains timetrees reported from molecular clock analyses in 910 published studies and 17 341 species that span the diversity of life. TimeTree interprets complex and hierarchical data from these studies for each user query, which can be launched using an iPhone application, in addition to the website.


Provides evolutionary models for all genomic components of fully sequenced bacterial genomes describing the pattern of common ancestry observed among sequences. GeneTrees is a phylogenomics analysis platform that presents integrated visualization of phylogenetic models with their underlying sequence data. The database is structured as a list of taxa (species or within-species variants such as strains) using the GenBank taxonomy, the complete set of protein sequences annotated for these genomes and a set of homology models.

FGF / Fishing Gene Family

Integrates homologous search and evolutionary analysis. FGF is a web server that uses protein sequences as bait for fishing gene families and extracting related information. It permits to visualize the chromosomal position of the duplications, the exon–intron structure and construct a phylogenetic tree based on a distance matrix. It deciphers not only the members of a gene family, but also their evolutionary relationships and helps in deducing the fate of the duplications.

DED / Database of Evolutionary Distances

Contains sequence information from several vertebrate species clustered into homology groups. DED is an online resource for gene function assignment, molecular phylogenetic studies, search for lateral gene transfer, reconstruction of identification of biochemical pathways in poorly characterized organisms and sequence evolution patterns. It also includes multiple sequence alignments for both protein and nucleotide sequences along with the phylogenetic trees and graphical representation of sequence relationships within a homology group.


Allows evaluation of the confidence of phylogenetic tree selection. CONSEL provides features to read the matrix of the site-wise log-likelihoods of the candidate trees. It can be applied to parsimony, the minimum evolution, or the least squares. This tool reads the bootstrap replicates of the log-likelihoods generated in amount. It can produce several p-values for the candidate trees; the Bootstrap Probability (BP), the Kishino-Hasegawa (KH) test, the Shimodaira-Hasegawa (SH) test, the Weighted Shimodaira-Hasegawa (WSH) test, and the Approximately Unbiased (AU) test.


Allows exploration of a range of pre-defined phylo-grammars and provides tools for visualization of user’s phylo-grammars. xREI is a web server that provides a method for visualizing phylo-grammars developed within the xrate framework. The software handles xrate grammars and stockholm alignments with defined phylogenetic trees. It can produce state transition graphs and substitution rate matrices for the grammar, and also provide training and alignment annotation functions.