SVMs Modeling for Highly Imbalanced Classification
Georgia State University · University of Notre Dame
Abstract
Traditional classification algorithms can be limited in their performance on highly unbalanced data sets. A popular stream of work for countering the problem of class imbalance has been the application of a sundry of sampling strategies. In this paper, we focus on designing modifications to support vector machines (SVMs) to appropriately tackle the problem of class imbalance. We incorporate different "rebalance" heuristics in SVM modeling, including cost-sensitive learning, and over- and undersampling. These SVM-based strategies are compared with various state-of-the-art approaches on a variety of data sets by using various metrics, including G-mean, area under the receiver operating characteristic curve,…
Citation impact
- FWCI
- 35.25
- Percentile
- 100%
- References
- 39
Authors
4Topics & keywords
- Undersampling
- Support vector machine
- Computer science
- Artificial intelligence
- Machine learning
- Class (philosophy)
- Set (abstract data type)
- Heuristics