TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Moment Retrieval/Charades-STA

Moment Retrieval on Charades-STA

Metric: R@1 IoU=0.5 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕R@1 IoU=0.5▼Extra DataPaperDate↕Code
1SG-DETR (w/ PT)71.1YesSaliency-Guided DETR for Moment Retrieval and Hi...2024-10-02Code
2LLaVA-MR70.65NoLLaVA-MR: Large Language-and-Vision Assistant fo...2024-11-21Code
3FlashVTG70.32NoFlashVTG: Feature Layering and Adaptive Score Ha...2024-12-18Code
4SG-DETR70.2NoSaliency-Guided DETR for Moment Retrieval and Hi...2024-10-02Code
5InternVideo2-6B70.03NoInternVideo2: Scaling Foundation Models for Mult...2024-03-22Code
6InternVideo2-1B68.36NoInternVideo2: Scaling Foundation Models for Mult...2024-03-22Code
7VideoChat-T (FT)67.1YesTimeSuite: Improving MLLMs for Long Video Unders...2024-10-25Code
8UniMD+Sync.63.98NoUniMD: Towards Unifying Moment Retrieval and Tem...2024-04-07Code
9LD-DETR62.58NoLD-DETR: Loop Decoder DEtection TRansformer for ...2025-01-18Code
10VideoLights-B-pt61.96YesVideoLights: Feature Refinement and Cross-Task A...2024-12-02Code
11UnLoc-L60.8NoUnLoc: A Unified Framework for Video Localizatio...2023-08-21Code
12BAM-DETR59.95NoBAM-DETR: Boundary-Aligned Moment Detection Tran...2023-11-30Code
13BM-DETR59.48NoBackground-aware Moment Detection for Video Mome...2023-06-05Code
14UVCOM59.25NoBridging the Gap: A Unified Video Comprehension ...2023-11-28Code
15CG-DETR58.44NoCorrelation-Guided Query-Dependency Calibration ...2023-11-15Code
16LLMEPET58.31NoPrior Knowledge Integration via LLM Encoding and...2024-07-21Code
17UnLoc-B58.1NoUnLoc: A Unified Framework for Video Localizatio...2023-08-21Code
18QD-DETR (Only Video)57.31NoQuery-Dependent Video Representation for Moment ...2023-03-24Code
19video-mamba-suite57.18NoVideo Mamba Suite: State Space Model as a Versat...2024-03-14Code
20Moment-DETR w/ PT (on 10K HowTo100M videos)55.65NoQVHighlights: Detecting Moments and Highlights i...2021-07-20Code
21Moment-DETR53.63NoQVHighlights: Detecting Moments and Highlights i...2021-07-20Code
22UMT (VO)49.35NoUMT: Unified Multi-modal Transformers for Joint ...2022-03-23Code
23VideoChat-T (ZS)48.7YesTimeSuite: Improving MLLMs for Long Video Unders...2024-10-25Code
24UMT (VA)48.31NoUMT: Unified Multi-modal Transformers for Joint ...2022-03-23Code
25SimVTP44.7NoSimVTP: Simple Video Text Pre-training with Mask...2022-12-07-