RoMa: Robust Dense Feature Matching
East China University of Science and Technology · Chalmers University of Technology
Abstract
Feature matching is an important computer vision task that involves estimating correspondences between two images of a 3D scene, and dense methods estimate all such correspondences. The aim is to learn a robust model, i.e., a model able to match under challenging real-world changes. In this work, we propose such a model, leveraging frozen pretrained features from the foundation model DINOv2. Al-though these features are significantly more robust than local features trained from scratch, they are inherently coarse. We therefore combine them with specialized ConvNet fine features, creating a precisely localizable feature pyramid. To further improve robustness, we propose a tailored transformer match decoder that…
Citation impact
- FWCI
- 40.01
- Percentile
- 100%
- References
- 0
Authors
5Topics & keywords
- Computer science
- Matching (statistics)
- Feature (linguistics)
- Artificial intelligence
- Pattern recognition (psychology)
- Mathematics
- Statistics