articleInternational Journal of EpidemiologyFeb 1, 2012BRONZE OA

Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies

Johns Hopkins University

PubMed
Indexed incrossrefpubmed

Abstract

Background

During the past 5 years, high-throughput technologies have been successfully used by epidemiology studies, but almost all have focused on sequence variation through genome-wide association studies (GWAS). Today, the study of other genomic events is becoming more common in large-scale epidemiological studies. Many of these, unlike the single-nucleotide polymorphism studied in GWAS, are continuous measures. In this context, the exercise of searching for regions of interest for disease is akin to the problems described in the statistical 'bump hunting' literature.

Methods

New statistical challenges arise when the measurements are continuous rather than categorical, when they are measured with uncertainty, and when both biological signal, and measurement errors are characterized by spatial correlation along the genome. Perhaps the most challenging complication is that continuous genomic data from large studies are measured throughout long periods, making them susceptible to 'batch effects'. An example that combines all three characteristics is genome-wide DNA methylation measurements. Here, we present a data analysis pipeline that effectively models measurement error, removes batch effects, detects regions of interest and attaches statistical uncertainty to identified regions.

Citation impact

768
total citations
FWCI
24.94
Percentile
100%
References
47
Citations per year

Authors

7

Topics & keywords

Keywords
  • Epigenetics
  • Epidemiology
  • Epigenesis
  • Biology
  • Genetics
  • DNA methylation
  • Computational biology
  • Evolutionary biology
UN Sustainable Development Goals
  • Good health and well-being
No related works found for this paper.