Video Retrieval on Condensed Movies

Metric: text-to-video R@10 (higher is better)

LeaderboardDataset
Loading chart...
#Modeltext-to-video R@10Extra DataPaperDateCode
1TESTA (ViT-B/16)55.1YesTESTA: Temporal-Spatial Token Aggregation for Lo...2023-10-29Code
2VINDLU44.3YesVindLU: A Recipe for Effective Video-and-Languag...2022-12-09Code
3LF-VILA 41.8YesLong-Form Video-Language Pre-Training with Multi...2022-10-12Code