reviewIEEE AccessJan 1, 2025GOLD OA

Imbalanced Data Problem in Machine Learning: A Review

King Khalid University

Indexed incrossrefdoaj

Abstract

One of the prominent challenges encountered in real-world data is an imbalance, characterized by unequal distribution of observations across different target classes, which complicates achieving accurate model classifications. This survey delves into various machine learning techniques developed to address the difficulties posed by imbalanced data. It discusses data-level methods such as oversampling and undersampling, algorithm-level solutions including ensemble learning and specific algorithm adjustments, cost-sensitive algorithms, and hybrid strategies that combine multiple approaches. Moreover, this paper emphasizes the crucial role of evaluation methods like Precision, F1 Score, Recall, G-mean, and AUC in…

Citation impact

144
total citations
FWCI
273.69
Percentile
100%
References
55
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Artificial intelligence
  • Machine learning
No related works found for this paper.

Funding