M. Bain et. al.
Reported on 4 benchmarks across 1 task · 1 paper · 3 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Computer Vision4 results
- text-to-video Median Rank· 2021-04-01SOTA7
- text-to-video R@10· 2021-04-01SOTA58.5best: 85.1 (InternVideo2-1B)
- text-to-video R@5· 2021-04-01SOTA46.4best: 80 (InternVideo2-6B)
- text-to-video R@1· 2021-04-0120.2best: 57.9 (InternVideo2-6B)