conPADE / Contig Ploidy and Allele Dosage Estimation
A probabilistic method that estimates the ploidy of any given contig/scaffold based on its allele proportions. ConPADE performs well as long as enough sequencing coverage is available, or the true contig ploidy is low. The method can be used for whole genome shotgun (WGS) sequencing data. It may also be used for related applications, such as the identification of duplicated genes in fragmented assemblies, although refinements are needed.
Aims to distinguish the distribution of base frequencies at variable sites for diploids, triploids and tetraploids directly from read mappings to a reference genome. nQuire is a statistical approach that models base frequencies as a Gaussian Mixture Model (GMM), and uses maximum likelihood to assess empirical data under the assumptions. This method could be useful to assess intraspecific variation in ploidy from both historic and modern samples, as well as in experimental evolution experiments.
