Imbalanced Data Problem in Machine Learning: A Review
Indexed incrossrefdoaj
Abstract
One of the prominent challenges encountered in real-world data is an imbalance, characterized by unequal distribution of observations across different target classes, which complicates achieving accurate model classifications. This survey delves into various machine learning techniques developed to address the difficulties posed by imbalanced data. It discusses data-level methods such as oversampling and undersampling, algorithm-level solutions including ensemble learning and specific algorithm adjustments, cost-sensitive algorithms, and hybrid strategies that combine multiple approaches. Moreover, this paper emphasizes the crucial role of evaluation methods like Precision, F1 Score, Recall, G-mean, and AUC in…
Citation impact
144
total citations
- FWCI
- 273.69
- Percentile
- 100%
- References
- 55
Citations per year
Authors
3Topics & keywords
Keywords
- Computer science
- Artificial intelligence
- Machine learning
No related works found for this paper.