Aggregating Local Image Descriptors into Compact Codes
Institut national de recherche en informatique et en automatique · Xerox (France) · +3 more institutions
Abstract
This paper addresses the problem of large-scale image search. Three constraints have to be taken into account: search accuracy, efficiency, and memory usage. We first present and evaluate different ways of aggregating local image descriptors into a vector and show that the Fisher kernel achieves better performance than the reference bag-of-visual words approach for any given vector dimension. We then jointly optimize dimensionality reduction and indexing in order to obtain a precise vector comparison as well as a compact representation. The evaluation shows that the image representation can be reduced to a few dozen bytes while preserving high accuracy. Searching a 100 million image data set takes about 250 ms…
Citation impact
- FWCI
- 56.51
- Percentile
- 100%
- References
- 44
Authors
6- HJH. JegouCorresponding
Institut national de recherche en informatique et en automatique
- FPFlorent Perronnin
Xerox (France)
- MDMatthijs Douze
Institut national de recherche en informatique et en automatique
- JSJorge Sánchez
Universidad Nacional de Córdoba
- PPPatrick Pérez
Technicolor (Germany), Technicolor (France)
Topics & keywords
- Fisher kernel
- Search engine indexing
- Kernel (algebra)
- Computer science
- Artificial intelligence
- Dimensionality reduction
- Pattern recognition (psychology)
- Representation (politics)