NetVLAD: CNN architecture for weakly supervised place recognition
Centre National de la Recherche Scientifique · Institut national de recherche en sciences et technologies du numérique · +4 more institutions
Abstract
We tackle the problem of large scale visual place recognition, where the task is to quickly and accurately recognize the location of a given query photograph. We present the following three principal contributions. First, we develop a convolutional neural network (CNN) architecture that is trainable in an end-to-end manner directly for the place recognition task. The main component of this architecture, NetVLAD, is a new generalized VLAD layer, inspired by the "Vector of Locally Aggregated Descriptors" image representation commonly used in image retrieval. The layer is readily pluggable into any CNN architecture and amenable to training via backpropagation. Second, we develop a training procedure, based on a…
Citation impact
- FWCI
- 43.98
- Percentile
- 100%
- References
- 126
Authors
5- ARArandjelovi\'c, ReljaCorresponding
Centre National de la Recherche Scientifique, Institut national de recherche en sciences et technologies du numérique, École Normale Supérieure - PSL, Département d'Informatique
- PGPetr Gronát
Centre National de la Recherche Scientifique, Institut national de recherche en sciences et technologies du numérique, École Normale Supérieure - PSL, Département d'Informatique
- ATAkihiko Torii
Tokyo Institute of Technology
- TPTomáš Pajdla
Czech Technical University in Prague
- JŠJosef Šivic
Centre National de la Recherche Scientifique, Institut national de recherche en sciences et technologies du numérique, École Normale Supérieure - PSL, Département d'Informatique, Czech Technical University in Prague
Topics & keywords
- Computer science
- Artificial intelligence
- Convolutional neural network
- Pattern recognition (psychology)
- Representation (politics)
- Architecture
- Layer (electronics)
- Ranking (information retrieval)
- Sustainable cities and communities