Rethinking Visual Geo-localization for Large-Scale Applications
Polytechnic University of Turin
Abstract
Visual Geo-localization (VG) is the task of estimating the position where a given photo was taken by comparing it with a large database of images of known locations. To investigate how existing techniques would perform on a real-world city-wide VG application, we build San Francisco eXtra Large, a new dataset covering a whole city and providing a wide range of challenging cases, with a size 30x bigger than the previous largest dataset for visual geo-localization. We find that current methods fail to scale to such large datasets, therefore we design a new highly scalable training technique, called CosPlace, which casts the training as a classification problem avoiding the expensive mining needed by the commonly…
Citation impact
- FWCI
- 12.80
- Percentile
- 100%
- References
- 63
Authors
3Topics & keywords
- Computer science
- Scalability
- Task (project management)
- Range (aeronautics)
- Artificial intelligence
- Scale (ratio)
- Domain (mathematical analysis)
- Code (set theory)
- Sustainable cities and communities