Robust deep learning–based protein sequence design using ProteinMPNN
University of Washington · Howard Hughes Medical Institute · +2 more institutions
Abstract
Although deep learning has revolutionized protein structure prediction, almost all experimentally characterized de novo protein designs have been generated using physically based approaches such as Rosetta. Here, we describe a deep learning-based protein sequence design method, ProteinMPNN, that has outstanding performance in both in silico and experimental tests. On native protein backbones, ProteinMPNN has a sequence recovery of 52.4% compared with 32.9% for Rosetta. The amino acid sequence at different positions can be coupled between single or multiple chains, enabling application to a wide range of current protein design challenges. We demonstrate the broad utility and high accuracy of ProteinMPNN using…
Citation impact
- FWCI
- 123.96
- Percentile
- 100%
- References
- 35
Authors
22Topics & keywords
- Protein design
- In silico
- Deep learning
- Sequence (biology)
- Protein sequencing
- Computational biology
- Peptide sequence
- Computer science
Funding
- NSNational Science FoundationAwards: DGE-2140004, DBI 1937533, 1937533, DE-AC02-05CH11231, P30 GM124169, 2140004
- UDU.S. Department of EnergyAwards: -AC02-05CH11231, 05CH11231, AC02-05CH11231, DE-AC02, DE-AC02-05CH11231, GM124169, DE-AC02-
- APAlfred P. Sloan FoundationAwards: G-2021-16899, DE-AC02-05CH11231
- WRWashington Research Foundation
- OPOpen Philanthropy Project
- NINational Institutes of HealthAwards: GM124169, P30 GM124169-01, P30 GM124169, DE-AC02-05CH11231
- OOOffice of ScienceAwards: AC02-05CH11231, -AC02-05CH11231, DE-AC02
- NINational Institute of General Medical SciencesAwards: DE-AC02-05CH11231, P30 GM124169, P30 GM124169-01, GM124169-01, GM124169
- BEBasic Energy SciencesAwards: DE-AC02, AC02-05CH11231, DE-AC02-05CH11231, -AC02-05CH11231