articleComputationMay 21, 2025GOLD OA

Assessing the Validity of k-Fold Cross-Validation for Model Selection: Evidence from Bankruptcy Prediction Using Random Forest and XGBoost

Bucharest University of Economic Studies

Indexed incrossrefdoaj

Abstract

Predicting corporate bankruptcy is a key task in financial risk management, and selecting a machine learning model with superior generalization performance is crucial for prediction accuracy. This study evaluates the effectiveness of k-fold cross-validation as a model selection strategy for random forest and XGBoost classifiers using a publicly available dataset of Taiwanese listed companies. We employ a nested cross-validation framework to assess the relationship between cross-validation (CV) and out-of-sample (OOS) performance on 40 different train/test data partitions. On average, we find k-fold cross-validation to be a valid selection technique when applied within a model class; however, k-fold…

Citation impact

42
total citations
FWCI
128.85
Percentile
100%
References
25
Citations per year

Authors

2

Topics & keywords

Keywords
  • Cross-validation
  • Fold (higher-order function)
  • Random forest
  • Model selection
  • Selection (genetic algorithm)
  • Statistics
  • Model validation
  • Econometrics
No related works found for this paper.