What is the best multi-stage architecture for object recognition?
Courant Institute of Mathematical Sciences · New York University
Abstract
In many recent object recognition systems, feature extraction stages are generally composed of a filter bank, a non-linear transformation, and some sort of feature pooling layer. Most systems use only one stage of feature extraction in which the filters are hard-wired, or two stages where the filters in one or both stages are learned in supervised or unsupervised mode. This paper addresses three questions: 1. How does the non-linearities that follow the filter banks influence the recognition accuracy? 2. does learning the filter banks in an unsupervised or supervised manner improve the performance over random filters or hardwired filters? 3. Is there any advantage to using an architecture with two stages of…
Citation impact
- FWCI
- 52.52
- Percentile
- 100%
- References
- 65
Authors
4- KJKevin JarrettCorresponding
Courant Institute of Mathematical Sciences, New York University
- KKKoray Kavukcuoglu
Courant Institute of Mathematical Sciences, New York University
- MRM. Ranzato
Courant Institute of Mathematical Sciences, New York University
- YLYann LeCun
New York University, Courant Institute of Mathematical Sciences
Topics & keywords
- Artificial intelligence
- Computer science
- Pattern recognition (psychology)
- Pooling
- MNIST database
- Feature extraction
- Normalization (sociology)
- Filter (signal processing)