Normalization software tools | DNA methylation microarray data analysis
Normalisation concerns the removal of sources of experimental artefacts, random noise and technical and systematic variation caused by microarray technology, which, if left unaddressed, has the potential to mask true biological differences. Two different types of normalisation exist: (1) between-array normalisation, removing technical artefacts between samples on different arrays, and (2) within-array normalisation, correcting for intensity-related dye biases.
An R package for comprehensive analysis of DNA methylation data obtained with any experimental protocol that provides single-CpG resolution, including Infinium 450K microarray and bisulfite sequencing protocols, but also MeDIP-seq and MBD-seq.
A suite of computational tools that incorporate state-of-the-art statistical techniques for the analysis of DNAm data. minfi provides methods for preprocessing, quality assessment and detection of differentially methylated regions from the kilobase to the megabase scale. Several preprocessing algorithms are available and the infrastructure provides a convenient way for developers to easily implement their techniques as Bioconductor tools. By making SNP annotation available, users can choose to be cautious about probes that may behave unexpectedly due to the inclusion of a SNP in the probe sequence. minfi is unique in that it provides both bump hunting and block finding capabilities, and the assessment of statistical significance for the identified regions. Finally, because the package is implemented in Bioconductor, it gives users access to the countless analysis and visualization tools available in R.
Performs normalisation, removal of unwanted variation in differential methylation analysis, differential variability testing and gene set analysis for the 450K array. The functions have been written to complement the limma package and are compatible with data objects from minfi, methylumi and edgeR. Detects differential variability for individual CpG sites in methylation data. DiffVar is available as a function in the missMethyl Bioconductor R package, and depends on the limma framework.
Provides class infrastructure and associated methods to construct an Illumina analysis workflow pipeline starting with raw data through functional analysis. Besides supporting the existing algorithms for microarray data, the lumi package includes several unique parts: (i) a variance-stabilizing transformation that utilizes the technical replicates available on the Illumina microarray; (ii) normalization algorithms designed for Illumina microarray data and; iii) the nucleotide universal identifier annotation packages.
Allows Illumina HumanMethylation BeadChip analysis. ChAMP is an integrated analysis pipeline including functions for (i) filtering low quality probes, adjustment for Infinium I and Infinium II probe design, (ii) batch effect correction, detecting differentially methylated positions (DMPs), (iii) finding differentially methylated regions (DMRs) and (iv) detection of copy number aberrations. The software also allows detection of differentially methylated genomic blocks (DMB) and Gene Set Enrichment Analysis (GSEA).
A within array normalization method that substantially reduces the technical variability between the probe types whilst maintaining the important biological differences. The SWAN method makes the assumption that the number of CpGs within the 50 bp probe sequence reflects the underlying biology of the region being interrogated. Hence, the overall distribution of intensities of probes with the same number of CpGs in the probe body should be the same. The method then uses a subset quantile normalization approach to adjust the intensities of the probes on the arrays.
A package that provides classes for holding and manipulating Illumina methylation data. Based on eSet, it can contain MIAME information, sample information, feature information, and multiple matrices of data. An ``intelligent'' import function, methylumiR can read the Illumina text files and create a MethyLumiSet. methylumIDAT can directly read raw IDAT files from HumanMethylation27 and HumanMethylation450 microarrays. Normalization, background correction, and quality control features for GoldenGate, Infinium, and Infinium HD arrays are also included.
Serves for processing Infinium DNA methylation data. SeSAMe consists of light and modular DNA methylation data analysis tool that can support the analysis of thousands of samples. It performs by masking deleted and hyperpolymorphic regions, decreasing or removing spurious reports of epigenetic silencing at oft-deleted tumor suppressor genes. This software reduces also technical variation while retaining biological variation.
An R package for normalization of data from the Illumina Infinium Human Methylation450 BeadChip (Illumina 450 K) built on the concepts in the recently published funNorm method, and introducing cell-type or tissue-type flexibility. funtooNorm is relevant for data sets containing samples from two or more cell or tissue types. A visual display of cross-validated errors informs the choice of the optimal number of components in the normalization. Improved normalization of datasets containing multiple tissues can be expected to translate into increased power to detect associations of interest, due to the inferred reduction in residual error; funNorm and this extension funtooNorm are designed with this goal in mind.
Proceeds quality control, normalizes and performs epigenome-wide association studies (EWAS). Meffil permits to minimize computational memory requirements to 5% of that required by other R packages excluding increasing running time. It is based on a reimplementation of functional normalization method. The tool normalizes datasets distributed across physically different locations, avoiding sharing any biologically-based individual-level data. In addition, it can reduce heterogeneity in meta-analyses of EWAS.
Allows to analyze and visualize differential methylation in genomic regions. ADMIRE is a semi-automatic pipeline that features five different normalization methods and performs two one-sided two-sample rank tests (Mann–Whitney U tests). The software features arbitrary experimental settings, quality control, automatic filtering, normalization, multiple testing, differential analyses on arbitrary genomic regions. It additionally implements a gene set enrichment procedure.
Allows researchers to perform preprocessing and analysis of microarray data via a graphical interface. eUTOPIA permits users to adjust parameters and observe the effect via graphical representations at each step, enabling them to make informed decisions. This method is capable of processing data from four microarray platforms: Agilent gene expression two color microarray data, Agilent gene expression one-color microarray data, Affymetrix gene expression microarray data, and Illumina methylation microarray data.
Aims to assist users in exploiting microarrays data. caCORRECT aims to improve downstream analysis, particularly in biomarker selection and translational bioinformatics. The application merges methods of normalization and multi-chip variance calculations based on technical or biological replicates to generate crisp representations of artifacts. It allows to both identify and remove errors in microarray experiments as well as describe experimental datasets and chips according to quality scores.