TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video/ActivityNet Captions

Video on ActivityNet Captions

Metric: R@1,IoU=0.7 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕R@1,IoU=0.7▼Extra DataPaperDate↕Code
1GVL (paragraph-level)38.55NoLearning Grounded Vision-Language Representation...2023-03-11Code
2LLaVA-MR35.68NoLLaVA-MR: Large Language-and-Vision Assistant fo...2024-11-21Code
3UnLoc-L30.2NoUnLoc: A Unified Framework for Video Localizatio...2023-08-21Code
4VLG-Net29.82NoVLG-Net: Video-Language Graph Matching Network f...2020-11-19Code
5UnLoc-B29.7NoUnLoc: A Unified Framework for Video Localizatio...2023-08-21Code
6GVL29.69NoLearning Grounded Vision-Language Representation...2023-03-11Code
7DRN24.36NoDense Regression Network for Video Grounding2020-04-07Code