articleNucleic Acids ResearchOct 7, 2014GOLD OA

svaseq: removing batch effects and other unwanted noise from sequencing data

Johns Hopkins University

PubMed
Indexed incrossrefdoajpubmed

Abstract

It is now known that unwanted noise and unmodeled artifacts such as batch effects can dramatically reduce the accuracy of statistical inference in genomic experiments. These sources of noise must be modeled and removed to accurately measure biological variability and to obtain correct statistical inference when performing high-throughput genomic analysis. We introduced surrogate variable analysis (sva) for estimating these artifacts by (i) identifying the part of the genomic data only affected by artifacts and (ii) estimating the artifacts with principal components or singular vectors of the subset of the data matrix. The resulting estimates of artifacts can be used in subsequent analyses as adjustment factors…

Citation impact

584
total citations
FWCI
11.38
Percentile
100%
References
45
Citations per year

Authors

1

Topics & keywords

Keywords
  • Bioconductor
  • Inference
  • Noise (video)
  • Computer science
  • Statistical inference
  • Transformation (genetics)
  • Data mining
  • Biology
No related works found for this paper.

Funding