Rapid in silico directed evolution by a protein language model with EVOLVEpro
Brigham and Women's Hospital · Beth Israel Deaconess Medical Center · +6 more institutions
Abstract
Directed protein evolution is central to biomedical applications but faces challenges such as experimental complexity, inefficient multiproperty optimization, and local maxima traps. Although in silico methods that use protein language models (PLMs) can provide modeled fitness landscape guidance, they struggle to generalize across diverse protein families and map to protein activity. We present EVOLVEpro, a few-shot active learning framework that combines PLMs and regression models to rapidly improve protein activity. EVOLVEpro surpasses current methods, yielding up to 100-fold improvements in desired properties. We demonstrate its effectiveness across six proteins in RNA production, genome editing, and…
Citation impact
- FWCI
- 30.42
- Percentile
- 100%
- References
- 78
Authors
12- KJKaiyi JiangCorresponding
Brigham and Women's Hospital, Beth Israel Deaconess Medical Center, Mass General Brigham, Massachusetts Institute of Technology
- ZYZhaoqing YanCorresponding
Brigham and Women's Hospital, Beth Israel Deaconess Medical Center, Mass General Brigham
- MDMatteo Di BernardoCorresponding
Whitehead Institute for Biomedical Research
- SRSamantha R. Sgrizzi
Brigham and Women's Hospital, Beth Israel Deaconess Medical Center, Mass General Brigham
- LVLukas Villiger
Kantonsspital St. Gallen
Topics & keywords
- In silico
- Computational biology
- Computer science
- Synthetic biology
- Artificial intelligence
- Protein engineering
- Language model
- Machine learning