Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis
Indexed incrossrefdoajpubmed
Abstract
It has unambiguously been shown that genetic, environmental, demographic, and technical factors may have substantial effects on gene expression levels. In addition to the measured variable(s) of interest, there will tend to be sources of signal due to factors that are unknown, unmeasured, or too complicated to capture through simple models. We show that failing to incorporate these sources of heterogeneity into an analysis can have widespread and detrimental effects on the study. Not only can this reduce power or induce unwanted dependence across genes, but it can also introduce sources of spurious signal to many genes. This phenomenon is true even for well-designed, randomized studies. We introduce "surrogate…
Citation impact
2,156
total citations
- FWCI
- 8.88
- Percentile
- 100%
- References
- 46
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Spurious relationship
- Biology
- Computational biology
- Variable (mathematics)
- Expression (computer science)
- Gene
- Gene expression
- Genetics
No related works found for this paper.