A sequence-based global map of regulatory activity for deciphering human genetics
Princeton University · Simons Foundation · +2 more institutions
Abstract
Epigenomic profiling has enabled large-scale identification of regulatory elements, yet we still lack a systematic mapping from any sequence or variant to regulatory activities. We address this challenge with Sei, a framework for integrating human genetics data with sequence information to discover the regulatory basis of traits and diseases. Sei learns a vocabulary of regulatory activities, called sequence classes, using a deep learning model that predicts 21,907 chromatin profiles across >1,300 cell lines and tissues. Sequence classes provide a global classification and quantification of sequence and variant effects based on diverse regulatory activities, such as cell type-specific enhancer functions. These…
Citation impact
- FWCI
- 20.79
- Percentile
- 100%
- References
- 44
Authors
4Topics & keywords
- Biology
- Regulatory sequence
- Computational biology
- Epigenomics
- Genetics
- Chromatin
- Sequence (biology)
- Enhancer
Funding
- NSNational Science Foundation
- UDU.S. Department of Health and Human ServicesAward: HHSN272201000054C
- SFSimons FoundationAward: 395506
- CPCancer Prevention and Research Institute of TexasAward: RR190071
- PUPrinceton University
- CICanadian Institute for Advanced Research
- UOUniversity of Texas Southwestern Medical Center
- FHFlatiron Health
- NINational Institutes of HealthAwards: R01HG005998, U54HL117798, R01GM071966, DP2GM146336