Video Retrieval on Condensed Movies

Metric: text-to-video R@5 (higher is better)

LeaderboardDataset
Loading chart...
#Modeltext-to-video R@5Extra DataPaperDateCode
1TESTA (ViT-B/16)46.5YesTESTA: Temporal-Spatial Token Aggregation for Lo...2023-10-29Code
2VINDLU36.4YesVindLU: A Recipe for Effective Video-and-Languag...2022-12-09Code
3LF-VILA 32.5YesLong-Form Video-Language Pre-Training with Multi...2022-10-12Code