NextPolish: a fast and efficient genome polishing tool for long-read assembly
Bioscience (China) · Grandomics (China)
Abstract
MOTIVATION: Although long-read sequencing technologies can produce genomes with long contiguity, they suffer from high error rates. Thus, we developed NextPolish, a tool that efficiently corrects sequence errors in genomes assembled with long reads. This new tool consists of two interlinked modules that are designed to score and count K-mers from high quality short reads, and to polish genome assemblies containing large numbers of base errors. RESULTS: When evaluated for the speed and efficiency using human and a plant (Arabidopsis thaliana) genomes, NextPolish outperformed Pilon by correcting sequence errors faster, and with a higher correction accuracy. AVAILABILITY AND IMPLEMENTATION: NextPolish is…
Citation impact
- FWCI
- 28.37
- Percentile
- 100%
- References
- 14
Authors
4Topics & keywords
- Python (programming language)
- Computer science
- Genome
- Sequence assembly
- k-mer
- Source code
- Software
- Contiguity