Canu: scalable and accurate long-read assembly via adaptive  k  -mer weighting and repeat separation

Koren, Sergey; Walenz, Brian P.; Berlin, Konstantin; Miller, Jason; Bergman, Nicholas H.; Phillippy, Adam M.

doi:10.1101/gr.215087.116

articleGenome ResearchMar 15, 2017BRONZE OA

Canu: scalable and accurate long-read assembly via adaptive k -mer weighting and repeat separation

SKSergey Koren BPBrian P. Walenz KBKonstantin Berlin JMJason Miller NHNicholas H. Bergman

National Institutes of Health · National Human Genome Research Institute · +3 more institutions

PubMed

Indexed incrossrefpubmed

Abstract

Long-read single-molecule sequencing has revolutionized de novo genome assembly and enabled the automated reconstruction of reference-quality genomes. However, given the relatively high error rates of such technologies, efficient and accurate assembly of large repeats and closely related haplotypes remains challenging. We address these issues with Canu, a successor of Celera Assembler that is specifically designed for noisy single-molecule sequences. Canu introduces support for nanopore sequencing, halves depth-of-coverage requirements, and improves assembly continuity while simultaneously reducing runtime by an order of magnitude on large genomes versus Celera Assembler 8.2. These advances result from new…

Citation impact

8,127

total citations

FWCI: —
Percentile: —
References: 73

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Biology
Weighting
Separation (statistics)
Computational biology
Genetics
Computer science
Machine learning
Physics

UN Sustainable Development Goals

Life below water

No related works found for this paper.