Direct prediction of intrinsically disordered protein conformational properties from sequence
Washington University in St. Louis
Abstract
Intrinsically disordered regions (IDRs) are ubiquitous across all domains of life and play a range of functional roles. While folded domains are generally well described by a stable three-dimensional structure, IDRs exist in a collection of interconverting states known as an ensemble. This structural heterogeneity means that IDRs are largely absent from the Protein Data Bank, contributing to a lack of computational approaches to predict ensemble conformational properties from sequence. Here we combine rational sequence design, large-scale molecular simulations and deep learning to develop ALBATROSS, a deep-learning model for predicting ensemble dimensions of IDRs, including the radius of gyration, end-to-end…
Citation impact
- FWCI
- 43.73
- Percentile
- 100%
- References
- 111
Authors
5Topics & keywords
- Intrinsically disordered proteins
- Sequence (biology)
- Computer science
- Radius of gyration
- Leverage (statistics)
- Ensemble learning
- Scaling
- Computational biology