articleTechnologiesFeb 20, 2025GOLD OA

Comprehensive Analysis of Random Forest and XGBoost Performance with SMOTE, ADASYN, and GNUS Under Varying Imbalance Levels

Stockholm University · University of Georgia

Indexed incrossrefdoaj

Abstract

This study examines the efficacy of Random Forest and XGBoost classifiers in conjunction with three upsampling techniques—SMOTE, ADASYN, and Gaussian noise upsampling (GNUS)—across datasets with varying class imbalance levels, ranging from moderate to extreme (15% to 1% churn rate). Employing metrics such as F1 score, ROC AUC, PR AUC, Matthews Correlation Coefficient (MCC), and Cohen’s Kappa, this research provides a comprehensive evaluation of classifier performance under different imbalance scenarios, focusing on applications in the telecommunications domain. The findings highlight that tuned XGBoost paired with SMOTE (Tuned_XGB_SMOTE) consistently achieves the highest F1 score and robust performance across…

Citation impact

150
total citations
FWCI
285.10
Percentile
100%
References
82
Citations per year

Authors

3

Topics & keywords

Keywords
  • Random forest
  • Computer science
  • Forestry
  • Statistics
  • Artificial intelligence
  • Mathematics
  • Agroforestry
  • Environmental science
No related works found for this paper.