TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video/MAD

Video on MAD

Metric: R@1,IoU=0.3 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕R@1,IoU=0.3▼Extra DataPaperDate↕Code
1ReVisionLLM12.7NoReVisionLLM: Recursive Vision-Language Model for...2024-11-22Code
2DeCafNet10.96NoDeCafNet: Delegate and Conquer for Efficient Tem...2025-05-22Code
3DeCafNet10.96NoDeCafNet: Delegate and Conquer for Efficient Tem...2025-05-22Code
4RGNet9.48NoRGNet: A Unified Clip Retrieval and Grounding Ne...2023-12-11Code
5Zero-Shot CLIP + Guidance Model4.65NoLocalizing Moments in Long Video Via Multimodal ...2023-02-26Code
6VLG-Net + Guidance Model4.28NoLocalizing Moments in Long Video Via Multimodal ...2023-02-26Code
7CLIP3.13NoMAD: A Scalable Dataset for Language Grounding i...2021-12-01Code
8VLG-Net2.63NoMAD: A Scalable Dataset for Language Grounding i...2021-12-01Code
9Random Chance0.04NoMAD: A Scalable Dataset for Language Grounding i...2021-12-01Code