What You See May Not Be What You Get: A Brief, Nontechnical Introduction to Overfitting in Regression-Type Models
Duke Medical Center · Duke University Hospital · +1 more institution
Abstract
Objective: Statistical models, such as linear or logistic regression or survival analysis, are frequently used as a means to answer scientific questions in psychosomatic research. Many who use these techniques, however, apparently fail to appreciate fully the problem of overfitting, ie, capitalizing on the idiosyncrasies of the sample at hand. Overfitted models will fail to replicate in future samples, thus creating considerable uncertainty about the scientific merit of the finding. The present article is a nontechnical discussion of the concept of overfitting and is intended to be accessible to readers with varying levels of statistical expertise. The notion of overfitting is presented in terms of asking too…
Citation impact
- FWCI
- 4.55
- Percentile
- 100%
- References
- 18
Authors
1Topics & keywords
- Overfitting
- Spurious relationship
- Computer science
- Set (abstract data type)
- Inference
- Artificial intelligence
- Machine learning
- Logistic regression
- Peace, Justice and strong institutions