Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions
Novartis Institutes for BioMedical Research · Novartis (Switzerland)
Abstract
A method to estimate ease of synthesis (synthetic accessibility) of drug-like molecules is needed in many areas of the drug discovery process. The development and validation of such a method that is able to characterize molecule synthetic accessibility as a score between 1 (easy to make) and 10 (very difficult to make) is described in this article.
The method for estimation of the synthetic accessibility score (SAscore) described here is based on a combination of fragment contributions and a complexity penalty. Fragment contributions have been calculated based on the analysis of one million representative molecules from PubChem and therefore one can say that they capture historical synthetic knowledge stored in this database. The molecular complexity score takes into account the presence of non-standard structural features, such as large rings, non-standard ring fusions, stereocomplexity and molecule size. The method has been validated by comparing calculated SAscores with ease of synthesis as estimated by experienced medicinal chemists for a set of 40 molecules. The agreement between calculated and manually estimated synthetic accessibility is very good with r2 = 0.89.
Citation impact
- FWCI
- 7.09
- Percentile
- 100%
- References
- 16
Authors
2Topics & keywords
- PubChem
- Drug discovery
- Computer science
- Fragment (logic)
- Set (abstract data type)
- Data mining
- Synthetic data
- Algorithm