Complete Pipeline for Infinium ® Human Methylation 450K BeadChip Data Processing Using Subset Quantile Normalization for Accurate DNA Methylation Estimation
Commissariat à l'Énergie Atomique et aux Énergies Alternatives · Genoscope
Abstract
Huge progress has been made in the development of array- or sequencing-based technologies for DNA methylation analysis. The Illumina Infinium(®) Human Methylation 450K BeadChip (Illumina Inc., CA, USA) allows the simultaneous quantitative monitoring of more than 480,000 CpG positions, enabling large-scale epigenotyping studies. However, the assay combines two different assay chemistries, which may cause a bias in the analysis if all signals are merged as a unique source of methylation measurement. MATERIALS & METHODS: We confirm in three 450K data sets that Infinium I signals are more stable and cover a wider dynamic range of methylation values than Infinium II signals. We evaluated the methylation profile of Infinium I and II probes obtained with different normalization protocols and compared these results with the methylation values of a subset of CpGs analyzed by pyrosequencing.
We developed a subset quantile normalization approach for the processing of 450K BeadChips. The Infinium I signals were used as 'anchors' to normalize Infinium II signals at the level of probe coverage categories. Our normalization approach outperformed alternative normalization or correction approaches in terms of bias correction and methylation signal estimation. We further implemented a complete preprocessing protocol that solves most of the issues currently raised by 450K array users.
Citation impact
- FWCI
- 14.51
- Percentile
- 100%
- References
- 27
Authors
2Topics & keywords
- Biology
- DNA methylation
- Normalization (sociology)
- Methylation
- Genetics
- Computational biology
- Pipeline (software)
- DNA