TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video/MAD

Video on MAD

Metric: R@1,IoU=0.1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕R@1,IoU=0.1▼Extra DataPaperDate↕Code
1ReVisionLLM17.3NoReVisionLLM: Recursive Vision-Language Model for...2024-11-22Code
2DeCafNet13.25NoDeCafNet: Delegate and Conquer for Efficient Tem...2025-05-22Code
3DeCafNet13.25NoDeCafNet: Delegate and Conquer for Efficient Tem...2025-05-22Code
4RGNet12.43NoRGNet: A Unified Clip Retrieval and Grounding Ne...2023-12-11Code
5DenoiseLoc11.59NoBoundary-Denoising for Video Activity Localization2023-04-06Code
6Zero-Shot CLIP + Guidance Model9.3NoLocalizing Moments in Long Video Via Multimodal ...2023-02-26Code
7CLIP6.57NoMAD: A Scalable Dataset for Language Grounding i...2021-12-01Code
8VLG-Net + Guidance Model5.6NoLocalizing Moments in Long Video Via Multimodal ...2023-02-26Code
9VLG-Net3.5NoMAD: A Scalable Dataset for Language Grounding i...2021-12-01Code
10Random Chance0.09NoMAD: A Scalable Dataset for Language Grounding i...2021-12-01Code