The MaSuRCA genome assembler
University of Maryland, Baltimore · Johns Hopkins University · +1 more institution
Abstract
MOTIVATION: Second-generation sequencing technologies produce high coverage of the genome by short reads at a low cost, which has prompted development of new assembly methods. In particular, multiple algorithms based on de Bruijn graphs have been shown to be effective for the assembly problem. In this article, we describe a new hybrid approach that has the computational efficiency of de Bruijn graph methods and the flexibility of overlap-based assembly strategies, and which allows variable read lengths while tolerating a significant level of sequencing error. Our method transforms large numbers of paired-end reads into a much smaller number of longer 'super-reads'. The use of super-reads allows us to assemble…
Citation impact
- FWCI
- 24.98
- Percentile
- 100%
- References
- 33
Authors
6Topics & keywords
- De Bruijn graph
- De Bruijn sequence
- Computer science
- Sequence assembly
- File Transfer Protocol
- Sanger sequencing
- Genome
- Reference genome