rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data
Abstract
The possibility of generating large RNA-sequencing datasets has led to development of various reference-based and de novo transcriptome assemblers with their own strengths and limitations. While reference-based tools are widely used in various transcriptomic studies, their application is limited to the organisms with finished and well-annotated genomes. De novo transcriptome reconstruction from short reads remains an open challenging problem, which is complicated by the varying expression levels across different genes, alternative splicing, and paralogous genes.
Herein we describe the novel transcriptome assembler rnaSPAdes, which has been developed on top of the SPAdes genome assembler and explores computational parallels between assembly of transcriptomes and single-cell genomes. We also present quality assessment reports for rnaSPAdes assemblies, compare it with modern transcriptome assembly tools using several evaluation approaches on various RNA-sequencing datasets, and briefly highlight strong and weak points of different assemblers.
Citation impact
- FWCI
- 28.97
- Percentile
- 100%
- References
- 43
Authors
4Topics & keywords
- Transcriptome
- Sequence assembly
- De novo transcriptome assembly
- Computational biology
- Genome
- RNA-Seq
- Computer science
- Biology