Core Vector Machines: Fast SVM Training on Very Large Data Sets
Hong Kong University of Science and Technology
Abstract
Standard SVM training has O(m3) time and O(m2) space complexities, where m is the training set size. It is thus computationally infeasible on very large data sets. By observing that practical SVM implementations only approximate the optimal solution by an iterative strategy, we scale up kernel methods by exploiting such "approximateness" in this paper. We first show that many kernel methods can be equivalently formulated as minimum enclosing ball (MEB) problems in computational geometry. Then, by adopting an efficient approximate MEB algorithm, we obtain provably approximately optimal solutions with the idea of core sets. Our proposed Core Vector Machine (CVM) algorithm can be used with nonlinear kernels and…
Citation impact
- FWCI
- 43.88
- Percentile
- 100%
- References
- 54
Authors
3Topics & keywords
- Pentium
- Support vector machine
- Computer science
- Kernel (algebra)
- Kernel method
- Algorithm
- Training set
- Gaussian function