baySeq: Empirical Bayesian methods for identifying differential expression in sequence count data
Abstract
High throughput sequencing has become an important technology for studying expression levels in many types of genomic, and particularly transcriptomic, data. One key way of analysing such data is to look for elements of the data which display particular patterns of differential expression in order to take these forward for further analysis and validation.
We propose a framework for defining patterns of differential expression and develop a novel algorithm, baySeq, which uses an empirical Bayes approach to detect these patterns of differential expression within a set of sequencing samples. The method assumes a negative binomial distribution for the data and derives an empirically determined prior distribution from the entire dataset. We examine the performance of the method on real and simulated data.
Citation impact
- FWCI
- 20.28
- Percentile
- 100%
- References
- 28
Authors
2Topics & keywords
- Computer science
- Pairwise comparison
- Bayes' theorem
- Negative binomial distribution
- Data mining
- Bayesian probability
- Count data
- Expression (computer science)