The impact of class imbalance in classification performance metrics based on the binary confusion matrix
Abstract
A major issue in the classification of class imbalanced datasets involves the determination of the most suitable performance metrics to be used. In previous work using several examples, it has been shown that imbalance can exert a major impact on the value and meaning of accuracy and on certain other well-known performance metrics. In this paper, our approach goes beyond simply studying case studies and develops a systematic analysis of this impact by simulating the results obtained using binary classifiers. A set of functions and numerical indicators are attained which enables the comparison of the behaviour of several performance metrics based on the binary confusion matrix when they are faced with…
Citation impact
- FWCI
- 67.46
- Percentile
- 100%
- References
- 53
Authors
4- ALAmalia Luque SendraCorresponding
Universidad de Sevilla
- ACAlejandro Carrasco
Universidad de Sevilla
- AMAlejandro Martín
Universidad de Sevilla
- ADAna de las Heras
Universidad de Sevilla
Topics & keywords
- Confusion matrix
- Class (philosophy)
- Confusion
- Binary number
- Matrix (chemical analysis)
- Computer science
- Artificial intelligence
- Pattern recognition (psychology)