articleJun 21, 2010Closed access

A Theoretical Analysis of Feature Pooling in Visual Recognition

Centre National de la Recherche Scientifique · Institut national de recherche en sciences et technologies du numérique · +3 more institutions

Abstract

Many modern visual recognition algorithms incorporate a step of spatial ‘pooling’, where the outputs of several nearby feature detectors are combined into a local or global ‘bag of features’, in a way that preserves task-related information while removing irrelevant details. Pooling is used to achieve invariance to image transformations, more compact representations, and better robustness to noise and clutter. Several papers have shown that the details of the pooling operation can greatly influence the performance, but studies have so far been purely empirical. In this paper, we show that the reasons underlying the performance of various pooling methods are obscured by several confounding factors, such as the…

No related works found for this paper.