Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Moment Retrieval
/
QVHighlights
Moment Retrieval on QVHighlights
Metric: R@1 IoU=0.5 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
Sort:
R@1 IoU=0.5 (best first)
R@1 IoU=0.5 (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
R@1 IoU=0.5
▼
Extra Data
Paper
Date
↕
Code
1
LLaVA-MR
76.59
No
LLaVA-MR: Large Language-and-Vision Assistant fo...
2024-11-21
Code
2
SG-DETR (w/ PT)
74.2
Yes
Saliency-Guided DETR for Moment Retrieval and Hi...
2024-10-02
Code
3
SG-DETR
72.2
No
Saliency-Guided DETR for Moment Retrieval and Hi...
2024-10-02
Code
4
InternVideo2-6B
71.42
Yes
InternVideo2: Scaling Foundation Models for Mult...
2024-03-22
Code
5
FlashVTG
70.69
No
FlashVTG: Feature Layering and Adaptive Score Ha...
2024-12-18
Code
6
VideoLights-B-pt
70.36
Yes
VideoLights: Feature Refinement and Cross-Task A...
2024-12-02
Code
7
CG-DETR (w/ PT)
68.48
Yes
Correlation-Guided Query-Dependency Calibration ...
2023-11-15
Code
8
R^2-Tuning
68.03
No
$R^2$-Tuning: Efficient Image-to-Video Transfer ...
2024-03-31
Code
9
LD-DETR
66.8
No
LD-DETR: Loop Decoder DEtection TRansformer for ...
2025-01-18
Code
10
LLMEPET
66.73
No
Prior Knowledge Integration via LLM Encoding and...
2024-07-21
Code
11
video-mamba-suite
66.65
No
Video Mamba Suite: State Space Model as a Versat...
2024-03-14
Code
12
UnLoc-L
66.1
No
UnLoc: A Unified Framework for Video Localizatio...
2023-08-21
Code
13
UniVTG (w/ PT)
65.43
Yes
UniVTG: Towards Unified Video-Language Temporal ...
2023-07-31
Code
14
CG-DETR
65.43
No
Correlation-Guided Query-Dependency Calibration ...
2023-11-15
Code
15
UVCOM (w/ PT ASR Captions)
64.53
Yes
Bridging the Gap: A Unified Video Comprehension ...
2023-11-28
Code
16
UnLoc-B
64.5
No
UnLoc: A Unified Framework for Video Localizatio...
2023-08-21
Code
17
QD-DETR (w/ PT)
64.1
No
Query-Dependent Video Representation for Moment ...
2023-03-24
Code
18
BAM-DETR (w/ audio)
64.07
No
BAM-DETR: Boundary-Aligned Moment Detection Tran...
2023-11-30
Code
19
LA-DETR
63.94
No
Length-Aware DETR for Robust Moment Retrieval
2024-12-30
Code
20
BAM-DETR (w/ PT ASR Captions)
63.88
Yes
BAM-DETR: Boundary-Aligned Moment Detection Tran...
2023-11-30
Code
21
UVCOM
63.55
No
Bridging the Gap: A Unified Video Comprehension ...
2023-11-28
Code
22
QD-DETR (only Video w/ PT ASR Captions)
63.2
No
Query-Dependent Video Representation for Moment ...
2023-03-24
Code
23
QD-DETR (w/ audio)
63.06
No
Query-Dependent Video Representation for Moment ...
2023-03-24
Code
24
BAM-DETR
62.71
No
BAM-DETR: Boundary-Aligned Moment Detection Tran...
2023-11-30
Code
25
QD-DETR (only Video)
62.4
No
Query-Dependent Video Representation for Moment ...
2023-03-24
Code
26
BM-DETR
60.12
No
Background-aware Moment Detection for Video Mome...
2023-06-05
Code
27
Moment-DETR (w/ PT ASR Cpations)
59.78
No
QVHighlights: Detecting Moments and Highlights i...
2021-07-20
Code
28
DenoiseLoc
59.27
No
Boundary-Denoising for Video Activity Localization
2023-04-06
Code
29
UniVTG
58.86
No
UniVTG: Towards Unified Video-Language Temporal ...
2023-07-31
Code
30
SeViLA-Localizer
54.5
No
-
-
-
#1
LLaVA-MR
SOTA
76.59
R@1 IoU=0.5
· 2024-11-21
LLaVA-MR: Large Language-and-Vision Assistant for Video Moment Retrieval
Code
#2
SG-DETR (w/ PT)
SOTA
74.2
R@1 IoU=0.5
· Extra Data
· 2024-10-02
Saliency-Guided DETR for Moment Retrieval and Highlight Detection
Code
#3
SG-DETR
72.2
R@1 IoU=0.5
· 2024-10-02
Saliency-Guided DETR for Moment Retrieval and Highlight Detection
Code
#4
InternVideo2-6B
SOTA
71.42
R@1 IoU=0.5
· Extra Data
· 2024-03-22
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding
Code
#5
FlashVTG
70.69
R@1 IoU=0.5
· 2024-12-18
FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding
Code
#6
VideoLights-B-pt
70.36
R@1 IoU=0.5
· Extra Data
· 2024-12-02
VideoLights: Feature Refinement and Cross-Task Alignment Transformer for Joint Video Highlight Detection and Moment Retrieval
Code
#7
CG-DETR (w/ PT)
SOTA
68.48
R@1 IoU=0.5
· Extra Data
· 2023-11-15
Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding
Code
#8
R^2-Tuning
68.03
R@1 IoU=0.5
· 2024-03-31
$R^2$-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding
Code
#9
LD-DETR
66.8
R@1 IoU=0.5
· 2025-01-18
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Code
#10
LLMEPET
66.73
R@1 IoU=0.5
· 2024-07-21
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Code
#11
video-mamba-suite
66.65
R@1 IoU=0.5
· 2024-03-14
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
Code
#12
UnLoc-L
SOTA
66.1
R@1 IoU=0.5
· 2023-08-21
UnLoc: A Unified Framework for Video Localization Tasks
Code
#13
UniVTG (w/ PT)
SOTA
65.43
R@1 IoU=0.5
· Extra Data
· 2023-07-31
UniVTG: Towards Unified Video-Language Temporal Grounding
Code
#14
CG-DETR
65.43
R@1 IoU=0.5
· 2023-11-15
Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding
Code
#15
UVCOM (w/ PT ASR Captions)
64.53
R@1 IoU=0.5
· Extra Data
· 2023-11-28
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
Code
#16
UnLoc-B
64.5
R@1 IoU=0.5
· 2023-08-21
UnLoc: A Unified Framework for Video Localization Tasks
Code
#17
QD-DETR (w/ PT)
SOTA
64.1
R@1 IoU=0.5
· 2023-03-24
Query-Dependent Video Representation for Moment Retrieval and Highlight Detection
Code
#18
BAM-DETR (w/ audio)
64.07
R@1 IoU=0.5
· 2023-11-30
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos
Code
#19
LA-DETR
63.94
R@1 IoU=0.5
· 2024-12-30
Length-Aware DETR for Robust Moment Retrieval
Code
#20
BAM-DETR (w/ PT ASR Captions)
63.88
R@1 IoU=0.5
· Extra Data
· 2023-11-30
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos
Code
#21
UVCOM
63.55
R@1 IoU=0.5
· 2023-11-28
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
Code
#22
QD-DETR (only Video w/ PT ASR Captions)
63.2
R@1 IoU=0.5
· 2023-03-24
Query-Dependent Video Representation for Moment Retrieval and Highlight Detection
Code
#23
QD-DETR (w/ audio)
63.06
R@1 IoU=0.5
· 2023-03-24
Query-Dependent Video Representation for Moment Retrieval and Highlight Detection
Code
#24
BAM-DETR
62.71
R@1 IoU=0.5
· 2023-11-30
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos
Code
#25
QD-DETR (only Video)
62.4
R@1 IoU=0.5
· 2023-03-24
Query-Dependent Video Representation for Moment Retrieval and Highlight Detection
Code
#26
BM-DETR
60.12
R@1 IoU=0.5
· 2023-06-05
Background-aware Moment Detection for Video Moment Retrieval
Code
#27
Moment-DETR (w/ PT ASR Cpations)
SOTA
59.78
R@1 IoU=0.5
· 2021-07-20
QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries
Code
#28
DenoiseLoc
59.27
R@1 IoU=0.5
· 2023-04-06
Boundary-Denoising for Video Activity Localization
Code
#29
UniVTG
58.86
R@1 IoU=0.5
· 2023-07-31
UniVTG: Towards Unified Video-Language Temporal Grounding
Code
#30
SeViLA-Localizer
54.5
R@1 IoU=0.5
No paper