Abstract

Feature matching is an important computer vision task that involves estimating correspondences between two images of a 3D scene, and dense methods estimate all such correspondences. The aim is to learn a robust model, i.e., a model able to match under challenging real-world changes. In this work, we propose such a model, leveraging frozen pretrained features from the foundation model DINOv2. Al-though these features are significantly more robust than local features trained from scratch, they are inherently coarse. We therefore combine them with specialized ConvNet fine features, creating a precisely localizable feature pyramid. To further improve robustness, we propose a tailored transformer match decoder that…

Citation impact

178
total citations
FWCI
40.01
Percentile
100%
References
0
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • Matching (statistics)
  • Feature (linguistics)
  • Artificial intelligence
  • Pattern recognition (psychology)
  • Mathematics
  • Statistics
No related works found for this paper.

Funding