articleBMC GenomicsMay 1, 2013GOLD OA

A data-driven approach to preprocessing Illumina 450K methylation array data

King's College London · University of Exeter

PubMed
Indexed incrossrefdoajpubmed

Abstract

Background

As the most stable and experimentally accessible epigenetic mark, DNA methylation is of great interest to the research community. The landscape of DNA methylation across tissues, through development and in disease pathogenesis is not yet well characterized. Thus there is a need for rapid and cost effective methods for assessing genome-wide levels of DNA methylation. The Illumina Infinium HumanMethylation450 (450K) BeadChip is a very useful addition to the available methods for DNA methylation analysis but its complex design, incorporating two different assay methods, requires careful consideration. Accordingly, several normalization schemes have been published. We have taken advantage of known DNA methylation patterns associated with genomic imprinting and X-chromosome inactivation (XCI), in addition to the performance of SNP genotyping assays present on the array, to derive three independent metrics which we use to test alternative schemes of correction and normalization. These metrics also have potential utility as quality scores for datasets.

Results

The standard index of DNA methylation at any specific CpG site is β = M/(M + U + 100) where M and U are methylated and unmethylated signal intensities, respectively. Betas (βs) calculated from raw signal intensities (the default GenomeStudio behavior) perform well, but using 11 methylomic datasets we demonstrate that quantile normalization methods produce marked improvement, even in highly consistent data, by all three metrics. The commonly used procedure of normalizing betas is inferior to the separate normalization of M and U, and it is also advantageous to normalize Type I and Type II assays separately. More elaborate manipulation of quantiles proves to be counterproductive.

Citation impact

1,267
total citations
FWCI
26.32
Percentile
100%
References
24
Citations per year

Authors

6

Topics & keywords

Keywords
  • DNA methylation
  • Normalization (sociology)
  • Biology
  • Methylation
  • Computational biology
  • Epigenetics
  • CpG site
  • Genetics
No related works found for this paper.

Funding