Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly
National Institutes of Health · National Center for Biotechnology Information · +8 more institutions
Abstract
The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the…
Citation impact
- FWCI
- —
- Percentile
- —
- References
- 79
Authors
38- VSValérie SchneiderCorresponding
National Institutes of Health, National Center for Biotechnology Information
- TATina A. Graves-Lindsay
James S. McDonnell Foundation
- KHKerstin Howe
Wellcome Sanger Institute
- NBNathan Bouk
National Institutes of Health, National Center for Biotechnology Information
- HCHsiu-Chuan Chen
National Institutes of Health, National Center for Biotechnology Information
Topics & keywords
- Biology
- Computational biology
- Reference genome
- Annotation
- Sequence assembly
- Human genome
- Genome
- Population
Funding
- HHHoward Hughes Medical Institute
- WWellcomeAwards: WT098051, WT095908, WT104947/Z/14/Z
- WTWellcome TrustAwards: WT095908, WT104947/Z/14/Z, WT098051
- EMEuropean Molecular Biology Laboratory
- NINational Institutes of HealthAwards: 5U54HG003079, HG002385, 5U41HG007635, HG007635
- NHNational Human Genome Research InstituteAward: 5U54HG003079
- UNU.S. National Library of Medicine