M. Bain et. al.

Reported on 4 benchmarks across 1 task · 1 paper · 3 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision4 results

Zero-Shot Video RetrievalonDiDeMo
text-to-video Median Rank· 2021-04-01
7
SOTA
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval arXiv:2104.00650
Zero-Shot Video RetrievalonDiDeMo
text-to-video R@10· 2021-04-01
58.5
best: 85.1 (InternVideo2-1B)
SOTA
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval arXiv:2104.00650
Zero-Shot Video RetrievalonDiDeMo
text-to-video R@5· 2021-04-01
46.4
best: 80 (InternVideo2-6B)
SOTA
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval arXiv:2104.00650
Zero-Shot Video RetrievalonDiDeMo
text-to-video R@1· 2021-04-01
20.2
best: 57.9 (InternVideo2-6B)
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval arXiv:2104.00650