articleNature BiotechnologyJul 27, 2015HYBRID OA

Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning

University of Toronto · Canadian Institute for Advanced Research

PubMed
Indexed incrossrefpubmed

Abstract

Knowing the sequence specificities of DNA- and RNA-binding proteins is essential for developing models of the regulatory processes in biological systems and for identifying causal disease variants. Here we show that sequence specificities can be ascertained from experimental data with 'deep learning' techniques, which offer a scalable, flexible and unified computational approach for pattern discovery. Using a diverse array of experimental data and evaluation metrics, we find that deep learning outperforms other state-of-the-art methods, even when training on in vitro data and testing on in vivo data. We call this approach DeepBind and have built a stand-alone software tool that is fully automatic and handles…

Citation impact

3,148
total citations
FWCI
95.31
Percentile
100%
References
49
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computational biology
  • Sequence (biology)
  • Deep learning
  • Computer science
  • Artificial intelligence
  • DNA sequencing
  • Scalability
  • RNA
No related works found for this paper.

Funding