Unsupervised evolution of protein and antibody complexes with a structure-informed language model
Stanford University · Chan Zuckerberg Initiative (United States)
Abstract
Large language models trained on sequence information alone can learn high-level principles of protein design. However, beyond sequence, the three-dimensional structures of proteins determine their specific function, activity, and evolvability. Here, we show that a general protein language model augmented with protein structure backbone coordinates can guide evolution for diverse proteins without the need to model individual functional tasks. We also demonstrate that ESM-IF1, which was only trained on single-chain structures, can be extended to engineer protein complexes. Using this approach, we screened about 30 variants of two therapeutic clinical antibodies used to treat severe acute respiratory syndrome…
Citation impact
- FWCI
- 25.38
- Percentile
- 100%
- References
- 79
Authors
4Topics & keywords
- Computational biology
- Evolvability
- Computer science
- Protein structure
- Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)
- Sequence (biology)
- Antibody
- Protein sequencing