Integrating Hi-C links with assembly graphs for chromosome-scale assembly
National Human Genome Research Institute · University of Maryland, College Park · +1 more institution
Abstract
Long-read sequencing and novel long-range assays have revolutionized de novo genome assembly by automating the reconstruction of reference-quality genomes. In particular, Hi-C sequencing is becoming an economical method for generating chromosome-scale scaffolds. Despite its increasing popularity, there are limited open-source tools available. Errors, particularly inversions and fusions across chromosomes, remain higher than alternate scaffolding technologies. We present a novel open-source Hi-C scaffolder that does not require an a priori estimate of chromosome number and minimizes errors by scaffolding with the assistance of an assembly graph. We demonstrate higher accuracy than the state-of-the-art methods…
Citation impact
- FWCI
- 27.35
- Percentile
- 100%
- References
- 54
Authors
8Topics & keywords
- Python (programming language)
- Computer science
- Sequence assembly
- Open source
- Chromosome
- Computational biology
- Genome
- Source code