Rapid traversal of vast chemical space using machine learning-guided docking screens
Broad Institute · Uppsala University · +7 more institutions
Abstract
The accelerating growth of make-on-demand chemical libraries provides unprecedented opportunities to identify starting points for drug discovery with virtual screening. However, these multi-billion-scale libraries are challenging to screen, even for the fastest structure-based docking methods. Here we explore a strategy that combines machine learning and molecular docking to enable rapid virtual screening of databases containing billions of compounds. In our workflow, a classification algorithm is trained to identify top-scoring compounds based on molecular docking of 1 million compounds to the target protein. The conformal prediction framework is then used to make selections from the multi-billion-scale…
Citation impact
- FWCI
- 58.94
- Percentile
- 100%
- References
- 56
Authors
11- ALAndreas LuttensCorresponding
Broad Institute, Uppsala University, Science for Life Laboratory, Massachusetts Institute of Technology
- ICIsrael Cabeza de Vaca
Uppsala University, Science for Life Laboratory
- LSLeonard Sparring
Uppsala University, Science for Life Laboratory
- JBJosé Brea
Universidade de Santiago de Compostela, Center for Research in Molecular Medicine and Chronic Diseases, Instituto de Investigación Sanitaria de Santiago
- ALAntón L. Martínez
Universidade de Santiago de Compostela, Center for Research in Molecular Medicine and Chronic Diseases, Instituto de Investigación Sanitaria de Santiago
Topics & keywords
- Virtual screening
- Workflow
- Computer science
- Docking (animal)
- Drug discovery
- Chemical space
- Tree traversal
- Classifier (UML)
Funding
- ELEli Lilly and Company
- ECEuropean CommissionAward: 715052
- CCancerfonden
- LULinköpings UniversitetAward: 2022-06725
- KOKnut och Alice Wallenbergs Stiftelse
- VVetenskapsrådetAward: 2022-06725
- UUUppsala UniversitetAwards: AI4Research, 2022-06725
- XDXunta de GaliciaAwards: ED431C, ED431C 2022/20
- EREuropean Regional Development FundAward: ED431C
- AEAgencia Estatal de InvestigaciónAward: PID2020-119428RB-I00
- NSNational Supercomputer Centre, Linköpings UniversitetAward: 2022-06725