HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads
National Institutes of Health · National Human Genome Research Institute · +4 more institutions
Abstract
Complete and accurate genome assemblies form the basis of most downstream genomic analyses and are of critical importance. Recent genome assembly projects have relied on a combination of noisy long-read sequencing and accurate short-read sequencing, with the former offering greater assembly continuity and the latter providing higher consensus accuracy. The recently introduced Pacific Biosciences (PacBio) HiFi sequencing technology bridges this divide by delivering long reads (>10 kbp) with high per-base accuracy (>99.9%). Here we present HiCanu, a modification of the Canu assembler designed to leverage the full potential of HiFi reads via homopolymer compression, overlap-based error correction, and aggressive…
Citation impact
- FWCI
- —
- Percentile
- —
- References
- 83
Authors
10- SNSergey NurkCorresponding
National Institutes of Health, National Human Genome Research Institute
- BPBrian P. Walenz
National Institutes of Health, National Human Genome Research Institute
- ARArang Rhie
National Institutes of Health, National Human Genome Research Institute
- MRMitchell R. Vollger
University of Washington
- GAGlennis A. Logsdon
University of Washington
Topics & keywords
- Biology
- Contig
- Segmental duplication
- Sequence assembly
- Genome
- Nanopore sequencing
- Human genome
- Computational biology
- Life below water