articleJournal of Chemical Information and ModelingJun 22, 2015Closed access

Beware of R 2 : Simple, Unambiguous Assessment of the Prediction Accuracy of QSAR and QSPR Models

CSIRO Manufacturing · University of North Carolina at Chapel Hill · +1 more institution

PubMed
Indexed incrossrefpubmed

Abstract

The statistical metrics used to characterize the external predictivity of a model, i.e., how well it predicts the properties of an independent test set, have proliferated over the past decade. This paper clarifies some apparent confusion over the use of the coefficient of determination, R(2), as a measure of model fit and predictive power in QSAR and QSPR modeling. R(2) (or r(2)) has been used in various contexts in the literature in conjunction with training and test data for both ordinary linear regression and regression through the origin as well as with linear and nonlinear regression models. We analyze the widely adopted model fit criteria suggested by Golbraikh and Tropsha ( J. Mol. Graphics Modell. 2002…

Citation impact

729
total citations
FWCI
40.70
Percentile
100%
References
20
Citations per year

Authors

3

Topics & keywords

Keywords
  • Quantitative structure–activity relationship
  • Linear regression
  • Computer science
  • Statistic
  • Linear model
  • Test set
  • Set (abstract data type)
  • Coefficient of determination
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.