Predicting RNA-seq coverage from DNA sequence as a unifying model of gene regulation
Enzo Life Sciences (United States) · Sanofi (United States) · +1 more institution
Abstract
Sequence-based machine-learning models trained on genomics data improve genetic variant interpretation by providing functional predictions describing their impact on the cis-regulatory code. However, current tools do not predict RNA-seq expression profiles because of modeling challenges. Here, we introduce Borzoi, a model that learns to predict cell-type-specific and tissue-specific RNA-seq coverage from DNA sequence. Using statistics derived from Borzoi's predicted coverage, we isolate and accurately score DNA variant effects across multiple layers of regulation, including transcription, splicing and polyadenylation. Evaluated on quantitative trait loci, Borzoi is competitive with and often outperforms…
Citation impact
- FWCI
- 92.93
- Percentile
- 100%
- References
- 124
Authors
5Topics & keywords
- Biology
- Sequence (biology)
- Computational biology
- Genetics
- Gene
- DNA sequencing
- DNA
- RNA
Funding
- NINational Institutes of Health
- NHNational Heart, Lung, and Blood Institute
- NINational Institute of Mental Health
- NINational Institute on Drug Abuse
- NHNational Human Genome Research Institute
- NONIH Office of the Director
- NCNational Cancer Institute
- NINational Institute of Neurological Disorders and Stroke
- CFCommon Fund