What is the best multi-stage architecture for object recognition?

Jarrett, Kevin; Kavukcuoglu, Koray; Ranzato, M.; LeCun, Yann

doi:10.1109/iccv.2009.5459469

articleSep 1, 2009Closed access

What is the best multi-stage architecture for object recognition?

KJKevin Jarrett KKKoray Kavukcuoglu MRM. Ranzato YLYann LeCun

Courant Institute of Mathematical Sciences · New York University

Indexed incrossref

Abstract

In many recent object recognition systems, feature extraction stages are generally composed of a filter bank, a non-linear transformation, and some sort of feature pooling layer. Most systems use only one stage of feature extraction in which the filters are hard-wired, or two stages where the filters in one or both stages are learned in supervised or unsupervised mode. This paper addresses three questions: 1. How does the non-linearities that follow the filter banks influence the recognition accuracy? 2. does learning the filter banks in an unsupervised or supervised manner improve the performance over random filters or hardwired filters? 3. Is there any advantage to using an architecture with two stages of…

Citation impact

2,171

total citations

FWCI: 52.52
Percentile: 100%
References: 65

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Artificial intelligence
Computer science
Pattern recognition (psychology)
Pooling
MNIST database
Feature extraction
Normalization (sociology)
Filter (signal processing)

No related works found for this paper.