Handling imbalanced medical datasets: review of a decade of research
National Higher School of Statistics and Applied Economy · University of Córdoba · +2 more institutions
Abstract
Abstract Machine learning and medical diagnostic studies often struggle with the issue of class imbalance in medical datasets, complicating accurate disease prediction and undermining diagnostic tools. Despite ongoing research efforts, specific characteristics of medical data frequently remain overlooked. This article comprehensively reviews advances in addressing imbalanced medical datasets over the past decade, offering a novel classification of approaches into preprocessing, learning levels, and combined techniques. We present a detailed evaluation of the medical datasets and metrics used, synthesizing the outcomes of previous research to reflect on the effectiveness of the methodologies despite…
Citation impact
- FWCI
- 63.18
- Percentile
- 100%
- References
- 133
Authors
5Topics & keywords
- Computer science
- Data science
- Machine learning
- Artificial intelligence
- Data mining