A complete reference genome improves analysis of human genetic variation
Johns Hopkins University · University of California, Davis · +13 more institutions
Abstract
Compared to its predecessors, the Telomere-to-Telomere CHM13 genome adds nearly 200 million base pairs of sequence, corrects thousands of structural errors, and unlocks the most complex regions of the human genome for clinical and functional study. We show how this reference universally improves read mapping and variant calling for 3202 and 17 globally diverse samples sequenced with short and long reads, respectively. We identify hundreds of thousands of variants per sample in previously unresolved regions, showcasing the promise of the T2T-CHM13 reference for evolutionary and biomedical discovery. Simultaneously, this reference eliminates tens of thousands of spurious variants per sample, including reduction…
Citation impact
- FWCI
- 38.11
- Percentile
- 100%
- References
- 106
Authors
33Topics & keywords
- Reference genome
- Human genome
- False positive paradox
- Genome
- Biology
- Computational biology
- Structural variation
- Genetics