Linear spatial pyramid matching using sparse coding for image classification
University of Illinois Urbana-Champaign · NEC (United States)
Abstract
Recently SVMs using spatial pyramid matching (SPM) kernel have been highly successful in image classification. Despite its popularity, these nonlinear SVMs have a complexity O(n 2 ∼ n 3 ) in training and O(n) in testing, where n is the training size, implying that it is nontrivial to scaleup the algorithms to handlemore than thousands of training images. In this paper we develop an extension of the SPM method, by generalizing vector quantization to sparse coding followed by multi-scale spatial max pooling, and propose a linear SPM kernel based on SIFT sparse codes. This new approach remarkably reduces the complexity of SVMs to O(n) in training and a constant in testing. In a number of image categorization…
Citation impact
- FWCI
- 49.27
- Percentile
- 100%
- References
- 37
Authors
4- JYJianchao YangCorresponding
University of Illinois Urbana-Champaign
- KYKai Yu
NEC (United States)
- YGYihong Gong
NEC (United States)
- TSThomas S. Huang
University of Illinois Urbana-Champaign
Topics & keywords
- Artificial intelligence
- Pattern recognition (psychology)
- Pyramid (geometry)
- Support vector machine
- Computer science
- Pooling
- Kernel (algebra)
- Scale-invariant feature transform
- Quality Education