Xiaolong Wang, Allan Jabri, Alexei A. Efros
We introduce a self-supervised method for learning visual correspondence from unlabeled video. The main idea is to use cycle-consistency in time as free supervisory signal for learning visual representations from scratch. At training time, our model learns a feature map representation to be useful for performing cycle-consistent tracking. At test time, we use the acquired representation to find nearest neighbors across space and time. We demonstrate the generalizability of the representation -- without finetuning -- across a range of visual correspondence tasks, including video object segmentation, keypoint tracking, and optical flow. Our approach outperforms previous self-supervised methods and performs competitively with strongly supervised methods.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Video | DAVIS 2017 (val) | F-measure (Mean) | 50 | CycleTime |
| Video | DAVIS 2017 (val) | F-measure (Recall) | 48 | CycleTime |
| Video | DAVIS 2017 (val) | J&F | 48.7 | CycleTime |
| Video | DAVIS 2017 (val) | Jaccard (Mean) | 46.4 | CycleTime |
| Video | DAVIS 2017 (val) | Jaccard (Recall) | 50 | CycleTime |
| Video Object Segmentation | DAVIS 2017 (val) | F-measure (Mean) | 50 | CycleTime |
| Video Object Segmentation | DAVIS 2017 (val) | F-measure (Recall) | 48 | CycleTime |
| Video Object Segmentation | DAVIS 2017 (val) | J&F | 48.7 | CycleTime |
| Video Object Segmentation | DAVIS 2017 (val) | Jaccard (Mean) | 46.4 | CycleTime |
| Video Object Segmentation | DAVIS 2017 (val) | Jaccard (Recall) | 50 | CycleTime |
| Semi-Supervised Video Object Segmentation | DAVIS 2017 (val) | F-measure (Mean) | 50 | CycleTime |
| Semi-Supervised Video Object Segmentation | DAVIS 2017 (val) | F-measure (Recall) | 48 | CycleTime |
| Semi-Supervised Video Object Segmentation | DAVIS 2017 (val) | J&F | 48.7 | CycleTime |
| Semi-Supervised Video Object Segmentation | DAVIS 2017 (val) | Jaccard (Mean) | 46.4 | CycleTime |
| Semi-Supervised Video Object Segmentation | DAVIS 2017 (val) | Jaccard (Recall) | 50 | CycleTime |