Zero-Shot Video Retrieval on VATEX

Metric: video-to-text R@5 (higher is better)

LeaderboardDataset
Loading chart...
#Modelvideo-to-text R@5Extra DataPaperDateCode
1InternVideo2-6B97.9YesInternVideo2: Scaling Foundation Models for Mult...2024-03-22Code
2InternVideo2-1B97.6YesInternVideo2: Scaling Foundation Models for Mult...2024-03-22Code
3VideoCoCa93.2YesVideoCoCa: Video-Text Modeling with Zero-Shot Tr...2022-12-09-