Exploratory Undersampling for Class-Imbalance Learning

Nanjing University · Georgia Institute of Technology

PubMed
Indexed incrossrefpubmed

Abstract

Undersampling is a popular method in dealing with class-imbalance problems, which uses only a subset of the majority class and thus is very efficient. The main deficiency is that many majority class examples are ignored. We propose two algorithms to overcome this deficiency. EasyEnsemble samples several subsets from the majority class, trains a learner using each of them, and combines the outputs of those learners. BalanceCascade trains the learners sequentially, where in each step, the majority class examples that are correctly classified by the current trained learners are removed from further consideration. Experimental results show that both methods have higher Area Under the ROC Curve, F-measure, and…

Citation impact

2,467
total citations
FWCI
63.46
Percentile
100%
References
66
Citations per year

Authors

3

Topics & keywords

Keywords
  • Undersampling
  • Class (philosophy)
  • Computer science
  • Artificial intelligence
  • Train
  • Machine learning
  • Mathematics
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.