Learning Correspondence From the Cycle-Consistency of Time
Carnegie Mellon University · Berkeley College · +1 more institution
Abstract
We introduce a self-supervised method for learning visual correspondence from unlabeled video. The main idea is to use cycle-consistency in time as free supervisory signal for learning visual representations from scratch. At training time, our model learns a feature map representation to be useful for performing cycle-consistent tracking. At test time, we use the acquired representation to find nearest neighbors across space and time. We demonstrate the generalizability of the representation -- without finetuning -- across a range of visual correspondence tasks, including video object segmentation, keypoint tracking, and optical flow. Our approach outperforms previous self-supervised methods and performs…
Citation impact
- FWCI
- 32.25
- Percentile
- 100%
- References
- 118
Authors
3Topics & keywords
- Computer science
- Artificial intelligence
- Generalizability theory
- Representation (politics)
- Feature (linguistics)
- Consistency (knowledge bases)
- Pattern recognition (psychology)
- Segmentation
- Quality Education