Metric: Cap. Avg. R@5 (higher is better)
| # | Model↕ | Cap. Avg. R@5▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Norton | 95 | No | Multi-granularity Correspondence Learning from L... | 2024-01-30 | Code |
| 2 | TempCLR | 94.6 | No | TempCLR: Temporal Alignment Representation with ... | 2022-12-28 | Code |
| 3 | VideoCLIP | 94.5 | No | VideoCLIP: Contrastive Pre-training for Zero-sho... | 2021-09-28 | Code |
| 4 | MCN | 75 | No | Multimodal Clustering Networks for Self-supervis... | 2021-04-26 | Code |
| 5 | Text-Video Embedding | 74.3 | No | HowTo100M: Learning a Text-Video Embedding by Wa... | 2019-06-07 | Code |
| 6 | MIL-NCE | 68.6 | No | End-to-End Learning of Visual Representations fr... | 2019-12-13 | Code |