TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Moment Retrieval/QVHighlights

Moment Retrieval on QVHighlights

Metric: mAP (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕mAP▼Extra DataPaperDate↕Code
1SG-DETR (w/ PT)58.8YesSaliency-Guided DETR for Moment Retrieval and Hi...2024-10-02Code
2SG-DETR54.1NoSaliency-Guided DETR for Moment Retrieval and Hi...2024-10-02Code
3LLaVA-MR52.73NoLLaVA-MR: Large Language-and-Vision Assistant fo...2024-11-21Code
4FlashVTG52NoFlashVTG: Feature Layering and Adaptive Score Ha...2024-12-18Code
5InternVideo2-6B49.24YesInternVideo2: Scaling Foundation Models for Mult...2024-03-22Code
6SG-DETR (ZS)48.3YesSaliency-Guided DETR for Moment Retrieval and Hi...2024-10-02Code
7CG-DETR (w/ PT)47.97YesCorrelation-Guided Query-Dependency Calibration ...2023-11-15Code
8VideoLights-B-pt47.94YesVideoLights: Feature Refinement and Cross-Task A...2024-12-02Code
9LA-DETR47.93NoLength-Aware DETR for Robust Moment Retrieval2024-12-30Code
10BAM-DETR (w/ audio)46.91NoBAM-DETR: Boundary-Aligned Moment Detection Tran...2023-11-30Code
11BAM-DETR (w/ PT ASR Captions)46.67YesBAM-DETR: Boundary-Aligned Moment Detection Tran...2023-11-30Code
12LD-DETR46.41NoLD-DETR: Loop Decoder DEtection TRansformer for ...2025-01-18Code
13R^2-Tuning46.17No$R^2$-Tuning: Efficient Image-to-Video Transfer ...2024-03-31Code
14BAM-DETR45.36NoBAM-DETR: Boundary-Aligned Moment Detection Tran...2023-11-30Code
15video-mamba-suite45.18NoVideo Mamba Suite: State Space Model as a Versat...2024-03-14Code
16LLMEPET44.05NoPrior Knowledge Integration via LLM Encoding and...2024-07-21Code
17UVCOM (w/ PT ASR Captions)43.8YesBridging the Gap: A Unified Video Comprehension ...2023-11-28Code
18UniVTG (w/ PT)43.63YesUniVTG: Towards Unified Video-Language Temporal ...2023-07-31Code
19UVCOM43.18NoBridging the Gap: A Unified Video Comprehension ...2023-11-28Code
20CG-DETR42.86NoCorrelation-Guided Query-Dependency Calibration ...2023-11-15Code
21QD-DETR (w/ PT)40.62NoQuery-Dependent Video Representation for Moment ...2023-03-24Code
22QD-DETR (w/ audio)40.19NoQuery-Dependent Video Representation for Moment ...2023-03-24Code
23BM-DETR40.08NoBackground-aware Moment Detection for Video Mome...2023-06-05Code
24QD-DETR (only Video w/ PT ASR Captions)40NoQuery-Dependent Video Representation for Moment ...2023-03-24Code
25QD-DETR (only Video)39.86NoQuery-Dependent Video Representation for Moment ...2023-03-24Code
26UMT (w/ audio + PT ASR Cpations)38.08NoUMT: Unified Multi-modal Transformers for Joint ...2022-03-23Code
27Moment-DETR (w/ PT ASR Cpations)36.14NoQVHighlights: Detecting Moments and Highlights i...2021-07-20Code
28UMT36.12NoUMT: Unified Multi-modal Transformers for Joint ...2022-03-23Code
29UniVTG35.47NoUniVTG: Towards Unified Video-Language Temporal ...2023-07-31Code
30SeViLA-Localizer32.3No---
31VTG-GPT30.91NoVTG-GPT: Tuning-Free Zero-Shot Video Temporal Gr...2024-03-04Code