Prediction error estimation: a comparison of resampling methods
Division of Cancer Epidemiology and Genetics · Zimmer Biomet (United States) · +1 more institution
Abstract
MOTIVATION: In genomic studies, thousands of features are collected on relatively few samples. One of the goals of these studies is to build classifiers to predict the outcome of future observations. There are three inherent steps to this process: feature selection, model selection and prediction assessment. With a focus on prediction assessment, we compare several methods for estimating the 'true' prediction error of a prediction model in the presence of feature selection. RESULTS: For small studies where features are selected from thousands of candidates, the resubstitution and simple split-sample estimates are seriously biased. In these small samples, leave-one-out cross-validation (LOOCV), 10-fold…
Citation impact
- FWCI
- 14.91
- Percentile
- 100%
- References
- 30
Authors
3Topics & keywords
- Resampling
- Estimation
- Computer science
- Jackknife resampling
- Statistics
- Artificial intelligence
- Mathematics
- Estimator
- Reduced inequalities