Metric: Cap. Avg. R@1 (higher is better)
| # | Model↕ | Cap. Avg. R@1▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Norton | 75.5 | No | Multi-granularity Correspondence Learning from L... | 2024-01-30 | Code |
| 2 | TempCLR | 74.5 | No | TempCLR: Temporal Alignment Representation with ... | 2022-12-28 | Code |
| 3 | VideoCLIP | 74.5 | No | VideoCLIP: Contrastive Pre-training for Zero-sho... | 2021-09-28 | Code |
| 4 | MCN | 53.4 | No | Multimodal Clustering Networks for Self-supervis... | 2021-04-26 | Code |
| 5 | Text-Video Embedding | 46.6 | No | HowTo100M: Learning a Text-Video Embedding by Wa... | 2019-06-07 | Code |
| 6 | MIL-NCE | 43.1 | No | End-to-End Learning of Visual Representations fr... | 2019-12-13 | Code |