TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video/ActivityNet

Video on ActivityNet

Metric: text-to-video Median Rank (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕text-to-video Median Rank▼Extra DataPaperDate↕Code
1Collaborative Experts6NoUse What You Have: Video Retrieval Using Represe...2019-07-31Code
2MMT5NoMulti-modal Transformer for Video Retrieval2020-07-21Code
3HD-VILA4NoAdvancing High-Resolution Video-Language Represe...2021-11-19Code
4MMT-Pretrained3.3YesMulti-modal Transformer for Video Retrieval2020-07-21Code
5TACo3YesTACo: Token-aware Cascade Contrastive Learning f...2021-08-23-
6DiffusionRet+QB-Norm2NoDiffusionRet: Generative Text-Video Retrieval wi...2023-03-17Code
7CenterCLIP (ViT-B/16)2YesCenterCLIP: Token Clustering for Efficient Text-...2022-05-02Code
8DiffusionRet2NoDiffusionRet: Generative Text-Video Retrieval wi...2023-03-17Code
9HBI2NoVideo-Text as Game Players: Hierarchical Banzhaf...2023-03-25Code
10CLIP4Clip2NoCLIP4Clip: An Empirical Study of CLIP for End to...2021-04-18Code
11CLIP-ViP1YesCLIP-ViP: Adapting Pre-trained Image-Text Model ...2022-09-14Code
12HunYuan_tvr1YesTencent Text-Video Retrieval: Hierarchical Cross...2022-04-07-
13DMAE (ViT-B/32)1NoDual-Modal Attention-Enhanced Text-Video Retriev...2023-09-20Code
14CAMoE1YesImproving Video-Text Retrieval by Multi-Stream C...2021-09-09Code