Removing technical variability in RNA-seq data using conditional quantile normalization
Indexed incrossrefpubmed
Abstract
The ability to measure gene expression on a genome-wide scale is one of the most promising accomplishments in molecular biology. Microarrays, the technology that first permitted this, were riddled with problems due to unwanted sources of variability. Many of these problems are now mitigated, after a decade's worth of statistical methodology development. The recently developed RNA sequencing (RNA-seq) technology has generated much excitement in part due to claims of reduced variability in comparison to microarrays. However, we show that RNA-seq data demonstrate unwanted and obscuring variability similar to what was first observed in microarrays. In particular, we find guanine-cytosine content (GC-content) has a…
Citation impact
667
total citations
- FWCI
- 16.26
- Percentile
- 100%
- References
- 38
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Normalization (sociology)
- Quantile
- DNA microarray
- Computer science
- False positive paradox
- RNA-Seq
- Quantile regression
- Data mining
No related works found for this paper.