TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video/ActivityNet Captions

Video on ActivityNet Captions

Metric: R@1,IoU=0.5 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕R@1,IoU=0.5▼Extra DataPaperDate↕Code
1GVL (paragraph-level)60.67NoLearning Grounded Vision-Language Representation...2023-03-11Code
2LLaVA-MR55.16NoLLaVA-MR: Large Language-and-Vision Assistant fo...2024-11-21Code
3GVL49.18NoLearning Grounded Vision-Language Representation...2023-03-11Code
4UnLoc-L48.3NoUnLoc: A Unified Framework for Video Localizatio...2023-08-21Code
5UnLoc-B48NoUnLoc: A Unified Framework for Video Localizatio...2023-08-21Code
6VLG-Net46.32NoVLG-Net: Video-Language Graph Matching Network f...2020-11-19Code
7DRN45.45NoDense Regression Network for Video Grounding2020-04-07Code