Zero-Shot Video Retrieval on MSR-VTT-full

Metric: video-to-text R@5 (higher is better)

LeaderboardDataset
Loading chart...
#Modelvideo-to-text R@5Extra DataPaperDateCode
1VideoCoCa85.2YesVideoCoCa: Video-Text Modeling with Zero-Shot Tr...2022-12-09-
2InternVL-G65.9YesInternVL: Scaling up Vision Foundation Models an...2023-12-21Code
3InternVL-C63.1NoInternVL: Scaling up Vision Foundation Models an...2023-12-21Code