TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Moment Retrieval/Charades-STA

Moment Retrieval on Charades-STA

Metric: R@1 IoU=0.7 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕R@1 IoU=0.7▼Extra DataPaperDate↕Code
1SG-DETR (w/ PT)52.8YesSaliency-Guided DETR for Moment Retrieval and Hi...2024-10-02Code
2FlashVTG49.87NoFlashVTG: Feature Layering and Adaptive Score Ha...2024-12-18Code
3LLaVA-MR49.58NoLLaVA-MR: Large Language-and-Vision Assistant fo...2024-11-21Code
4SG-DETR49.5NoSaliency-Guided DETR for Moment Retrieval and Hi...2024-10-02Code
5InternVideo2-6B48.95NoInternVideo2: Scaling Foundation Models for Mult...2024-03-22Code
6InternVideo2-1B45.03NoInternVideo2: Scaling Foundation Models for Mult...2024-03-22Code
7UniMD+Sync.44.46NoUniMD: Towards Unifying Moment Retrieval and Tem...2024-04-07Code
8VideoChat-T (FT)43YesTimeSuite: Improving MLLMs for Long Video Unders...2024-10-25Code
9LD-DETR41.56NoLD-DETR: Loop Decoder DEtection TRansformer for ...2025-01-18Code
10VideoLights-B-pt41.05YesVideoLights: Feature Refinement and Cross-Task A...2024-12-02Code
11BAM-DETR39.38NoBAM-DETR: Boundary-Aligned Moment Detection Tran...2023-11-30Code
12UnLoc-L38.4NoUnLoc: A Unified Framework for Video Localizatio...2023-08-21Code
13BM-DETR38.33NoBackground-aware Moment Detection for Video Mome...2023-06-05Code
14UVCOM36.64NoBridging the Gap: A Unified Video Comprehension ...2023-11-28Code
15LLMEPET36.49NoPrior Knowledge Integration via LLM Encoding and...2024-07-21Code
16CG-DETR36.34NoCorrelation-Guided Query-Dependency Calibration ...2023-11-15Code
17video-mamba-suite36.05NoVideo Mamba Suite: State Space Model as a Versat...2024-03-14Code
18UnLoc-B35.4NoUnLoc: A Unified Framework for Video Localizatio...2023-08-21Code
19Moment-DETR w/ PT (on 10K HowTo100M videos)34.17NoQVHighlights: Detecting Moments and Highlights i...2021-07-20Code
20QD-DETR (Only Video)32.55NoQuery-Dependent Video Representation for Moment ...2023-03-24Code
21Moment-DETR31.37NoQVHighlights: Detecting Moments and Highlights i...2021-07-20Code
22UMT (VA)29.25NoUMT: Unified Multi-modal Transformers for Joint ...2022-03-23Code
23SimVTP26.3NoSimVTP: Simple Video Text Pre-training with Mask...2022-12-07-
24UMT (VO)26.16NoUMT: Unified Multi-modal Transformers for Joint ...2022-03-23Code
25VideoChat-T (ZS)24YesTimeSuite: Improving MLLMs for Long Video Unders...2024-10-25Code