A theoretical distribution analysis of synthetic minority oversampling technique (SMOTE) for imbalanced learning
Cairo University · Canadian University of Dubai
Abstract
Abstract Class imbalance occurs when the class distribution is not equal. Namely, one class is under-represented (minority class), and the other class has significantly more samples in the data (majority class). The class imbalance problem is prevalent in many real world applications. Generally, the under-represented minority class is the class of interest. The synthetic minority over-sampling technique (SMOTE) method is considered the most prominent method for handling unbalanced data. The SMOTE method generates new synthetic data patterns by performing linear interpolation between minority class samples and their K nearest neighbors. However, the SMOTE generated patterns do not necessarily conform to the…
Citation impact
- FWCI
- 52.52
- Percentile
- 100%
- References
- 64
Authors
3Topics & keywords
- Oversampling
- Class (philosophy)
- Interpolation (computer graphics)
- Mathematics
- Distribution (mathematics)
- Artificial intelligence
- Algorithm
- Computer science
- Reduced inequalities