articleJun 16, 2024Closed access
Optimal Transport Aggregation for Visual Place Recognition
Indexed incrossref
Abstract
The task of Visual Place Recognition (VPR) aims to match a query image against references from an extensive database of images from different places, relying solely on visual cues. State-of-the-art pipelines focus on the aggre-gation offeatures extractedfrom a deep backbone, in order to form a global descriptor for each image. In this con-text, we introduce SALAD (Sinkhorn Algorithm for Locally Aggregated Descriptors), which reformulates NetVLAD's soft-assignment of local features to clusters as an optimal transport problem. In SALAD, we consider both feature-to-cluster and cluster-to-feature relations and we also in-troduce a ‘dustbin’ cluster, designed to selectively discard features deemed non-informative,…
Citation impact
111
total citations
- FWCI
- 148.55
- Percentile
- 100%
- References
- 72
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Computer science
- Artificial intelligence
- Computer vision
- Human–computer interaction
No related works found for this paper.