The sva package for removing batch effects and other unwanted variation in high-throughput experiments
Boston University · Johns Hopkins University · +1 more institution
Abstract
Abstract Summary: Heterogeneity and latent variables are now widely recognized as major sources of bias and variability in high-throughput experiments. The most well-known source of latent variation in genomic experiments are batch effects—when samples are processed on different days, in different groups or by different people. However, there are also a large number of other variables that may have a major impact on high-throughput measurements. Here we describe the sva package for identifying, estimating and removing unwanted sources of variation in high-throughput experiments. The sva package supports surrogate variable estimation with the sva function, direct adjustment for known batch effects with the…
Citation impact
- FWCI
- 18.54
- Percentile
- 100%
- References
- 12
Authors
5- JTJeffrey T. LeekCorresponding
Boston University, Johns Hopkins University, Princeton University
- WEW. Evan Johnson
Boston University, Johns Hopkins University, Princeton University
- HSHilary S. Parker
Boston University, Johns Hopkins University, Princeton University
- AEAndrew E. Jaffe
Boston University, Johns Hopkins University, Princeton University
- JDJohn D. Storey
Boston University, Johns Hopkins University, Princeton University
Topics & keywords
- Throughput
- Latent variable
- Computer science
- Function (biology)
- Variation (astronomy)
- Variable (mathematics)
- R package
- Statistics