NetVLAD: CNN Architecture for Weakly Supervised Place Recognition
Institut national de recherche en sciences et technologies du numérique · École Normale Supérieure · +3 more institutions
Abstract
We tackle the problem of large scale visual place recognition, where the task is to quickly and accurately recognize the location of a given query photograph. We present the following four principal contributions. First, we develop a convolutional neural network (CNN) architecture that is trainable in an end-to-end manner directly for the place recognition task. The main component of this architecture, NetVLAD, is a new generalized VLAD layer, inspired by the "Vector of Locally Aggregated Descriptors" image representation commonly used in image retrieval. The layer is readily pluggable into any CNN architecture and amenable to training via backpropagation. Second, we create a new weakly supervised ranking…
Citation impact
- FWCI
- 47.99
- Percentile
- 100%
- References
- 134
Authors
5- RARelja ArandjelovićCorresponding
Institut national de recherche en sciences et technologies du numérique, École Normale Supérieure
- PGPetr Gronát
Institut national de recherche en sciences et technologies du numérique, École Normale Supérieure
- ATAkihiko Torii
Tokyo Institute of Technology, Tokyo University of Technology
- TPTomáš Pajdla
Czech Technical University in Prague
- JŠJosef Šivic
Institut national de recherche en sciences et technologies du numérique, École Normale Supérieure
Topics & keywords
- Computer science
- Artificial intelligence
- Convolutional neural network
- Pattern recognition (psychology)
- Representation (politics)
- Architecture
- Layer (electronics)
- Ranking (information retrieval)
- Sustainable cities and communities