Manipulation of FASTQ data with Galaxy
Pennsylvania State University · Howard Hughes Medical Institute · +2 more institutions
Abstract
SUMMARY: Here, we describe a tool suite that functions on all of the commonly known FASTQ format variants and provides a pipeline for manipulating next generation sequencing data taken from a sequencing machine all the way through the quality filtering steps. AVAILABILITY AND IMPLEMENTATION: This open-source toolset was implemented in Python and has been integrated into the online data analysis platform Galaxy (public web access: http://usegalaxy.org; download: http://getgalaxy.org). Two short movies that highlight the functionality of tools described in this manuscript as well as results from testing components of this tool suite against a set of previously published files are available at…
Citation impact
- FWCI
- 10.21
- Percentile
- 100%
- References
- 4
Authors
7- DBDaniel Blankenberg
Pennsylvania State University, Howard Hughes Medical Institute, Emory University, Cold Spring Harbor Laboratory
- AGAssaf Gordon
Pennsylvania State University, Howard Hughes Medical Institute, Emory University, Cold Spring Harbor Laboratory
- GVGregory Von Kuster
Pennsylvania State University, Howard Hughes Medical Institute, Emory University, Cold Spring Harbor Laboratory
- NCNathan Coraor
Pennsylvania State University, Howard Hughes Medical Institute, Emory University, Cold Spring Harbor Laboratory
- JTJames TaylorCorresponding
Pennsylvania State University, Howard Hughes Medical Institute, Emory University, Cold Spring Harbor Laboratory
Topics & keywords
- Suite
- Python (programming language)
- Computer science
- Pipeline (software)
- Open source
- Information retrieval
- Set (abstract data type)
- World Wide Web