SARTools: A DESeq2- and EdgeR-Based R Pipeline for Comprehensive Differential Analysis of RNA-Seq Data
Centre National de la Recherche Scientifique · Institut Pasteur · +2 more institutions
Abstract
Several R packages exist for the detection of differentially expressed genes from RNA-Seq data. The analysis process includes three main steps, namely normalization, dispersion estimation and test for differential expression. Quality control steps along this process are recommended but not mandatory, and failing to check the characteristics of the dataset may lead to spurious results. In addition, normalization methods and statistical models are not exchangeable across the packages without adequate transformations the users are often not aware of. Thus, dedicated analysis pipelines are needed to include systematic quality control steps and prevent errors from misusing the proposed methods.
SARTools is an R pipeline for differential analysis of RNA-Seq count data. It can handle designs involving two or more conditions of a single biological factor with or without a blocking factor (such as a batch effect or a sample pairing). It is based on DESeq2 and edgeR and is composed of an R package and two R script templates (for DESeq2 and edgeR respectively). Tuning a small number of parameters and executing one of the R scripts, users have access to the full results of the analysis, including lists of differentially expressed genes and a HTML report that (i) displays diagnostic plots for quality control and model hypotheses checking and (ii) keeps track of the whole analysis process, parameter values and versions of the R packages used.
Citation impact
- FWCI
- 23.15
- Percentile
- 100%
- References
- 23
Authors
4- HVHugo Varet
Centre National de la Recherche Scientifique, Institut Pasteur, Centre de Recherche en Informatique
- LBLoraine Brillet-Guéguen
Centre National de la Recherche Scientifique, Station Biologique de Roscoff
- JCJean‐Yves Coppée
Institut Pasteur
- MDMarie‐Agnès DilliesCorresponding
Institut Pasteur, Centre National de la Recherche Scientifique, Centre de Recherche en Informatique
Topics & keywords
- Pipeline (software)
- Computational biology
- Physics
- Biology
- Computer science
- Programming language