FLASH: fast length adjustment of short reads to improve genome assemblies
Johns Hopkins University · Johns Hopkins Medicine
Abstract
MOTIVATION: Next-generation sequencing technologies generate very large numbers of short reads. Even with very deep genome coverage, short read lengths cause problems in de novo assemblies. The use of paired-end libraries with a fragment size shorter than twice the read length provides an opportunity to generate much longer reads by overlapping and merging read pairs before assembling a genome. RESULTS: We present FLASH, a fast computational tool to extend the length of short reads by overlapping paired-end reads from fragment libraries that are sufficiently short. We tested the correctness of the tool on one million simulated read pairs, and we then applied it as a pre-processor for genome assemblies of…
Citation impact
- FWCI
- 17.16
- Percentile
- 100%
- References
- 10
Authors
2Topics & keywords
- Computer science
- Contig
- Flash (photography)
- Genome
- Software
- Sequence assembly
- k-mer
- Hybrid genome assembly