reviewACM Computing SurveysAug 30, 2019Closed access

A Systematic Review on Imbalanced Data Challenges in Machine Learning

Thapar Institute of Engineering & Technology

Indexed incrossref

Abstract

In machine learning, the data imbalance imposes challenges to perform data analytics in almost all areas of real-world research. The raw primary data often suffers from the skewed perspective of data distribution of one class over the other as in the case of computer vision, information security, marketing, and medical science. The goal of this article is to present a comparative analysis of the approaches from the reference of data pre-processing, algorithmic and hybrid paradigms for contemporary imbalance data analysis techniques, and their comparative study in lieu of different data distribution and their application areas.

Citation impact

484
total citations
FWCI
23.65
Percentile
100%
References
137
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Raw data
  • Perspective (graphical)
  • Data science
  • Machine learning
  • Data analysis
  • Class (philosophy)
  • Artificial intelligence
No related works found for this paper.