Video on Condensed Movies

Metric: text-to-video R@1 (higher is better)

LeaderboardDataset
Loading chart...
#Modeltext-to-video R@1Extra DataPaperDateCode
1TESTA (ViT-B/16)24.9YesTESTA: Temporal-Spatial Token Aggregation for Lo...2023-10-29Code
2VINDLU18.4YesVindLU: A Recipe for Effective Video-and-Languag...2022-12-09Code
3LF-VILA 13.6YesLong-Form Video-Language Pre-Training with Multi...2022-10-12Code