Contrastive learning in protein language space predicts interactions between drugs and protein targets
Massachusetts Institute of Technology · Ragon Institute of MGH, MIT and Harvard · +1 more institution
Abstract
Sequence-based prediction of drug–target interactions has the potential to accelerate drug discovery by complementing experimental screens. Such computational prediction needs to be generalizable and scalable while remaining sensitive to subtle variations in the inputs. However, current computational techniques fail to simultaneously meet these goals, often sacrificing performance of one to achieve the others. We develop a deep learning model, ConPLex, successfully leveraging the advances in pretrained protein language models (“PLex”) and employing a protein-anchored contrastive coembedding (“Con”) to outperform state-of-the-art approaches. ConPLex achieves high accuracy, broad adaptivity to unseen data, and…
Citation impact
- FWCI
- 35.73
- Percentile
- 100%
- References
- 76
Authors
5Topics & keywords
- Computational biology
- Space (punctuation)
- Computer science
- Biology