TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video Retrieval/YouCook2

Video Retrieval on YouCook2

Metric: text-to-video Median Rank (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕text-to-video Median Rank▼Extra DataPaperDate↕Code
1Satar et al.77NoSemantic Role Aware Correlation Transformer for ...2022-06-26Code
2HGLMM FV CCA75No---
3RoME53NoRoME: Role-aware Mixture-of-Expert Transformer f...2022-06-26Code
4Text-Video Embedding24NoHowTo100M: Learning a Text-Video Embedding by Wa...2019-06-07Code
5COOT9NoCOOT: Cooperative Hierarchical Transformer for V...2020-11-01Code
6TACo4YesTACo: Token-aware Cascade Contrastive Learning f...2021-08-23-
7UniVL4YesUniVL: A Unified Video and Language Pre-Training...2020-02-15Code
8VLM4YesVLM: Task-agnostic Video-Language Model Pre-trai...2021-05-20Code
9UniVL + MELTR3NoMELTR: Meta Loss Transformer for Learning to Fin...2023-03-23Code
10MDMMT-23YesMDMMT-2: Multidomain Multimodal Transformer for ...2022-03-14-