reviewJournal Of Big DataMar 23, 2025GOLD OA

Resampling approaches to handle class imbalance: a review from a data perspective

University of Aveiro

Indexed incrossrefdoaj

Abstract

This article presents a data-driven review of resampling approaches aimed at mitigating the class imbalance problem in machine learning, a widespread issue that limits classifier performance across numerous sectors. Initially, this research provides an extensive theoretical examination of the class imbalance problem, emphasizing its propensity to amplify existing data difficulty factors, including class overlap, small disjuncts, and noise, thus biasing the model towards the majority class. Acknowledging the significance of detecting and quantifying the synergistic effects between class imbalance and these data difficulty factors, this study surveys metrics formulated to quantify such phenomena in imbalanced…

Citation impact

75
total citations
FWCI
140.46
Percentile
100%
References
129
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Perspective (graphical)
  • Computational Science and Engineering
  • Resampling
  • Class (philosophy)
  • Data science
  • Data mining
  • Machine learning
UN Sustainable Development Goals
  • Decent work and economic growth
No related works found for this paper.

Funding