Zero-Shot Video Retrieval on MSR-VTT-full

Metric: text-to-video R@1 (higher is better)

LeaderboardDataset
Loading chart...
#Modeltext-to-video R@1Extra DataPaperDateCode
1InternVL-G46.3YesInternVL: Scaling up Vision Foundation Models an...2023-12-21Code
2InternVL-C44.7NoInternVL: Scaling up Vision Foundation Models an...2023-12-21Code
3VideoCoCa34.3YesVideoCoCa: Video-Text Modeling with Zero-Shot Tr...2022-12-09-