Core Vector Machines: Fast SVM Training on Very Large Data Sets

Tsang, Ivor W.; Kwok, James T.; Cheung, Pak-Ming

doi:10.5555/1046920.1058114

articleDec 1, 2005Closed access

Core Vector Machines: Fast SVM Training on Very Large Data Sets

IWIvor W. Tsang JTJames T. Kwok PCPak-Ming Cheung

Hong Kong University of Science and Technology

Abstract

Standard SVM training has O(m3) time and O(m2) space complexities, where m is the training set size. It is thus computationally infeasible on very large data sets. By observing that practical SVM implementations only approximate the optimal solution by an iterative strategy, we scale up kernel methods by exploiting such "approximateness" in this paper. We first show that many kernel methods can be equivalently formulated as minimum enclosing ball (MEB) problems in computational geometry. Then, by adopting an efficient approximate MEB algorithm, we obtain provably approximately optimal solutions with the idea of core sets. Our proposed Core Vector Machine (CVM) algorithm can be used with nonlinear kernels and…

Citation impact

951

total citations

FWCI: 43.88
Percentile: 100%
References: 54

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Pentium
Support vector machine
Computer science
Kernel (algebra)
Kernel method
Algorithm
Training set
Gaussian function

No related works found for this paper.