Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies
National Institutes of Health · National Human Genome Research Institute
Abstract
Recent long-read assemblies often exceed the quality and completeness of available reference genomes, making validation challenging. Here we present Merqury, a novel tool for reference-free assembly evaluation based on efficient k-mer set operations. By comparing k-mers in a de novo assembly to those found in unassembled high-accuracy reads, Merqury estimates base-level accuracy and completeness. For trios, Merqury can also evaluate haplotype-specific accuracy, completeness, phase block continuity, and switch errors. Multiple visualizations, such as k-mer spectrum plots, can be generated for evaluation. We demonstrate on both human and plant genomes that Merqury is a fast and robust method for assembly…
Citation impact
- FWCI
- 73.60
- Percentile
- 100%
- References
- 53
Authors
4- ARArang RhieCorresponding
National Institutes of Health, National Human Genome Research Institute
- BPBrian P. Walenz
National Institutes of Health, National Human Genome Research Institute
- SKSergey Koren
National Institutes of Health, National Human Genome Research Institute
- AMAdam M. Phillippy
National Institutes of Health, National Human Genome Research Institute
Topics & keywords
- Completeness (order theory)
- Phaser
- Genome
- k-mer
- Biology
- Sequence assembly
- Computer science
- Set (abstract data type)