A Theoretical Analysis of Feature Pooling in Visual Recognition
Centre National de la Recherche Scientifique · Institut national de recherche en sciences et technologies du numérique · +3 more institutions
Abstract
Many modern visual recognition algorithms incorporate a step of spatial ‘pooling’, where the outputs of several nearby feature detectors are combined into a local or global ‘bag of features’, in a way that preserves task-related information while removing irrelevant details. Pooling is used to achieve invariance to image transformations, more compact representations, and better robustness to noise and clutter. Several papers have shown that the details of the pooling operation can greatly influence the performance, but studies have so far been purely empirical. In this paper, we show that the reasons underlying the performance of various pooling methods are obscured by several confounding factors, such as the…
Citation impact
- FWCI
- 28.45
- Percentile
- 100%
- References
- 20
Authors
3- YBY-Lan BoureauCorresponding
Centre National de la Recherche Scientifique, Institut national de recherche en sciences et technologies du numérique, Courant Institute of Mathematical Sciences, New York University
- JPJean Ponce
Centre National de la Recherche Scientifique, Institut national de recherche en sciences et technologies du numérique, École Normale Supérieure - PSL
- YLYann LeCun
Courant Institute of Mathematical Sciences, New York University
Topics & keywords
- Pooling
- Computer science
- Artificial intelligence
- Robustness (evolution)
- Clutter
- Pattern recognition (psychology)
- Cognitive neuroscience of visual object recognition
- Feature (linguistics)