Zero-Shot Video Retrieval on MSR-VTT-full
Metric: text-to-video R@5 (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | text-to-video R@5▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | InternVL-G | 70.5 | Yes | InternVL: Scaling up Vision Foundation Models an... | 2023-12-21 | Code |
| 2 | InternVL-C | 68.2 | No | InternVL: Scaling up Vision Foundation Models an... | 2023-12-21 | Code |
| 3 | VideoCoCa | 57.8 | Yes | VideoCoCa: Video-Text Modeling with Zero-Shot Tr... | 2022-12-09 | - |