Robust enzyme discovery and engineering with deep learning using CataPro
Shandong University · Viva Biotech (China) · +5 more institutions
Abstract
Abstract Accurate prediction of enzyme kinetic parameters is crucial for enzyme exploration and modification. Existing models face the problem of either low accuracy or poor generalization ability due to overfitting. In this work, we first developed unbiased datasets to evaluate the actual performance of these methods and proposed a deep learning model, CataPro, based on pre-trained models and molecular fingerprints to predict turnover number ( k c a t ), Michaelis constant ( K m ), and catalytic efficiency ( k c a t / K m ). Compared with previous baseline models, CataPro demonstrates clearly enhanced accuracy and generalization ability on the unbiased datasets. In a representational enzyme mining project, by…
Citation impact
- FWCI
- 81.34
- Percentile
- 100%
- References
- 98
Authors
9Topics & keywords
- Overfitting
- Generalization
- Computer science
- Artificial intelligence
- Machine learning
- Baseline (sea)
- Deep learning
- Mathematics