The complete sequence of a human genome
National Institutes of Health · National Human Genome Research Institute · +43 more institutions
Abstract
Since its initial release in 2000, the human reference genome has covered only the euchromatic fraction of the genome, leaving important heterochromatic regions unfinished. Addressing the remaining 8% of the genome, the Telomere-to-Telomere (T2T) Consortium presents a complete 3.055 billion-base pair sequence of a human genome, T2T-CHM13, that includes gapless assemblies for all chromosomes except Y, corrects errors in the prior references, and introduces nearly 200 million base pairs of sequence containing 1956 gene predictions, 99 of which are predicted to be protein coding. The completed regions include all centromeric satellite arrays, recent segmental duplications, and the short arms of all five…
Citation impact
- FWCI
- 667.36
- Percentile
- 100%
- References
- 128
Authors
100- SNSergey NurkCorresponding
National Institutes of Health, National Human Genome Research Institute
- SKSergey KorenCorresponding
National Institutes of Health, National Human Genome Research Institute
- ARArang RhieCorresponding
National Institutes of Health, National Human Genome Research Institute
- MRMikko RautiainenCorresponding
National Institutes of Health, National Human Genome Research Institute
- AVAndrey V. Bzikadze
University of California San Diego
Topics & keywords
- Euchromatin
- Genome
- Human genome
- Telomere
- Genetics
- Biology
- Centromere
- Genome project
Funding
- NSNational Science FoundationAwards: 1732253, 1627442, 1643825, 1613806, 1758800
- HHHoward Hughes Medical Institute
- DRDamon Runyon Cancer Research Foundation
- CIConnecticut Innovations
- SIStowers Institute for Medical Research
- UOUniversity of Pittsburgh
- AWAmazon Web Services
- ONOxford Nanopore Technologies
- SNSchweizerischer Nationalfonds zur Förderung der Wissenschaftlichen ForschungAward: 30039
- MMax-Planck-Gesellschaft
- SPSaint Petersburg State University
- RSRussian Science FoundationAwards: 19-75-30039, 075-10-2020-116
- NINational Institutes of HealthAwards: R24DK106766, U01HG010971, R01HG006677, U41HG007234, R01HG011274, R01HG009190, UM1HG008898, R01HG010040, F32GM134558, R01HG010485, R35GM133747, U24HG006620, R01HG010169, U24HG010263, U01CA253481, U41HG010972, R01HG002385, U01HG010961, R01GM123312, U24HG009081
- NINational Institute of Standards and Technology
- NHNational Human Genome Research InstituteAwards: U01HG010971, R01HG009190, R01HG011274, R01HG006677, R01HG010485, U24HG006620, U41HG010972, R01HG002385, U01HG010961, F31HG011205, R01HG010040, U41HG007234, UM1HG008898, U24HG010263
- NONIH Office of the Director
- UNU.S. National Library of Medicine
- CFCommon Fund