TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video/LSMDC

Video on LSMDC

Metric: video-to-text R@10 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕video-to-text R@10▼Extra DataPaperDate↕Code
1HunYuan_tvr (huge)91.8YesTencent Text-Video Retrieval: Hierarchical Cross...2022-04-07-
2UMT-L (ViT-L/16)71.5YesUnmasked Teacher: Towards Training-Efficient Vid...2023-03-28Code
3vid-TLDR (UMT-L)63.6Yesvid-TLDR: Training Free Token merging for Light-...2024-03-20Code
4CenterCLIP (ViT-B/16)55.8YesCenterCLIP: Token Clustering for Efficient Text-...2022-05-02Code
5HunYuan_tvr55.7YesTencent Text-Video Retrieval: Hierarchical Cross...2022-04-07-
6EMCL-Net++54.4NoExpectation-Maximization Contrastive Learning fo...2022-11-21Code
7DiffusionRet51.5NoDiffusionRet: Generative Text-Video Retrieval wi...2023-03-17Code
8X-Pool51.2YesX-Pool: Cross-Modal Language-Video Attention for...2022-03-28Code
9EMCL-Net49.2NoExpectation-Maximization Contrastive Learning fo...2022-11-21Code
10CLIP22.1NoA Straightforward Framework For Video Retrieval ...2021-02-24Code