Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Video
/
THUMOS’14
Video on THUMOS’14
Metric: mAP IOU@0.4 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
Sort:
mAP IOU@0.4 (best first)
mAP IOU@0.4 (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
mAP IOU@0.4
▼
Extra Data
Paper
Date
↕
Code
1
AdaTAD (VideoMAEv2-giant)
86.7
No
End-to-End Temporal Action Detection with 1B Par...
2023-11-28
Code
2
RDFA-S6 (InternVideo2-6B)
84.6
No
Enhancing Temporal Action Localization: Advanced...
2024-07-18
Code
3
ActionMamba(InternVideo2-6B)
83.09
No
Video Mamba Suite: State Space Model as a Versat...
2024-03-14
Code
4
TriDet (I3D features)
80.1
No
TriDet: Temporal Action Detection with Relative ...
2023-03-13
Code
5
TriDet (VideoMAE v2-g feature)
80
Yes
Temporal Action Localization with Enhanced Insta...
2023-09-11
Code
6
ActionFormer (VideoMAE V2-g features)
79.6
Yes
VideoMAE V2: Scaling Video Masked Autoencoders w...
2023-03-29
Code
7
ASL(I3D features)
79
No
Action Sensitivity Learning for Temporal Action ...
2023-05-25
-
8
TemporalMaxer (I3D features)
78.9
No
TemporalMaxer: Maximize Temporal Context with on...
2023-03-16
Code
9
DualDETR (I3D features)
78
No
Dual DETRs for Multi-Label Temporal Action Detec...
2024-03-31
-
10
ActionFormer (I3D features)
77.8
No
ActionFormer: Localizing Moments of Actions with...
2022-02-16
Code
11
BasicTAD (160,6,192,R50-SlowOnly)
70.8
No
BasicTAD: an Astounding RGB-Only Baseline for Te...
2022-05-05
Code
12
TadML(two-stream)
69.73
No
TadML: A fast temporal action detection with Mec...
2022-06-07
Code
13
TadTR
69.1
No
End-to-end Temporal Action Detection with Transf...
2021-06-18
Code
14
ReAct (TSN features)
65
No
ReAct: Temporal Action Detection with Relational...
2022-07-14
Code
15
BasicTAD (112,3,96,R50-SlowOnly)
65
No
BasicTAD: an Astounding RGB-Only Baseline for Te...
2022-05-05
Code
16
AVFusion
64.9
Yes
Hear Me Out: Fusional Approaches for Audio Augme...
2021-06-27
Code
17
TadML(rgb-only)
64.66
No
TadML: A fast temporal action detection with Mec...
2022-06-07
Code
18
E2E-TAD (SlowFast R50+TadTR)
64.3
No
An Empirical Study of End-to-End Temporal Action...
2022-04-06
Code
19
MUSES
64
No
Multi-shot Temporal Event Localization: a Benchm...
2020-12-17
Code
20
TAGS (I3D)
63.8
No
Proposal-Free Temporal Action Detection via Glob...
2022-07-14
Code
21
TSP
63.3
No
TSP: Temporally-Sensitive Pretraining of Video E...
2020-11-23
Code
22
DCAN (TSN features)
62.7
No
DCAN: Improving Temporal Action Detection via Du...
2021-12-07
Code
23
GCM
60.8
No
Graph Convolutional Module for Temporal Action L...
2021-12-01
-
24
VSGN
60.4
No
Video Self-Stitching Graph Network for Temporal ...
2020-11-30
Code
25
DaoTAD
59.5
No
RGB Stream Is Enough for Temporal Action Detection
2021-07-09
Code
26
AGT (Ours)
58.1
No
Activity Graph Transformer for Temporal Action L...
2021-01-21
-
27
P-GCN
57.8
No
Graph Convolutional Networks for Temporal Action...
2019-09-07
Code
28
Decouple-SSAD
54.1
No
Decoupling Localization and Classification in Si...
2019-04-16
Code
29
TAL-Net
48.5
No
Rethinking the Faster R-CNN Architecture for Tem...
2018-04-20
-
30
ASM-Loc
46.8
No
ASM-Loc: Action-aware Segment Modeling for Weakl...
2022-03-29
Code
31
CO2-Net
45.7
No
Cross-modal Consensus Network for Weakly Supervi...
2021-07-27
Code
32
BSN UNet
45
No
BSN: Boundary Sensitive Network for Temporal Act...
2018-06-08
Code
33
CBR-TS
41.3
No
Cascaded Boundary Regression for Temporal Action...
2017-05-02
-
34
A2CL-PT
39
No
Adversarial Background-Aware Loss for Weakly-sup...
2020-07-13
Code
35
R-C3D
35.6
No
R-C3D: Region Convolutional 3D Network for Tempo...
2017-03-22
Code
36
TURN-FL-16 + S-CNN
34.9
No
TURN TAP: Temporal Unit Regression Network for T...
2017-03-17
Code
37
CDC
29.4
No
CDC: Convolutional-De-Convolutional Networks for...
2017-03-04
Code
38
S-CNN
28.7
No
Temporal Action Localization in Untrimmed Videos...
2016-01-09
Code
39
Yeung et al.
26.4
No
End-to-end Learning of Action Detection from Fra...
2015-11-22
Code
#1
AdaTAD (VideoMAEv2-giant)
SOTA
86.7
mAP IOU@0.4
· 2023-11-28
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
Code
#2
RDFA-S6 (InternVideo2-6B)
84.6
mAP IOU@0.4
· 2024-07-18
Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent Mechanism
Code
#3
ActionMamba(InternVideo2-6B)
83.09
mAP IOU@0.4
· 2024-03-14
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
Code
#4
TriDet (I3D features)
SOTA
80.1
mAP IOU@0.4
· 2023-03-13
TriDet: Temporal Action Detection with Relative Boundary Modeling
Code
#5
TriDet (VideoMAE v2-g feature)
80
mAP IOU@0.4
· Extra Data
· 2023-09-11
Temporal Action Localization with Enhanced Instant Discriminability
Code
#6
ActionFormer (VideoMAE V2-g features)
79.6
mAP IOU@0.4
· Extra Data
· 2023-03-29
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Code
#7
ASL(I3D features)
79
mAP IOU@0.4
· 2023-05-25
Action Sensitivity Learning for Temporal Action Localization
#8
TemporalMaxer (I3D features)
78.9
mAP IOU@0.4
· 2023-03-16
TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization
Code
#9
DualDETR (I3D features)
78
mAP IOU@0.4
· 2024-03-31
Dual DETRs for Multi-Label Temporal Action Detection
#10
ActionFormer (I3D features)
SOTA
77.8
mAP IOU@0.4
· 2022-02-16
ActionFormer: Localizing Moments of Actions with Transformers
Code
#11
BasicTAD (160,6,192,R50-SlowOnly)
70.8
mAP IOU@0.4
· 2022-05-05
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
Code
#12
TadML(two-stream)
69.73
mAP IOU@0.4
· 2022-06-07
TadML: A fast temporal action detection with Mechanics-MLP
Code
#13
TadTR
SOTA
69.1
mAP IOU@0.4
· 2021-06-18
End-to-end Temporal Action Detection with Transformer
Code
#14
ReAct (TSN features)
65
mAP IOU@0.4
· 2022-07-14
ReAct: Temporal Action Detection with Relational Queries
Code
#15
BasicTAD (112,3,96,R50-SlowOnly)
65
mAP IOU@0.4
· 2022-05-05
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
Code
#16
AVFusion
64.9
mAP IOU@0.4
· Extra Data
· 2021-06-27
Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization
Code
#17
TadML(rgb-only)
64.66
mAP IOU@0.4
· 2022-06-07
TadML: A fast temporal action detection with Mechanics-MLP
Code
#18
E2E-TAD (SlowFast R50+TadTR)
64.3
mAP IOU@0.4
· 2022-04-06
An Empirical Study of End-to-End Temporal Action Detection
Code
#19
MUSES
SOTA
64
mAP IOU@0.4
· 2020-12-17
Multi-shot Temporal Event Localization: a Benchmark
Code
#20
TAGS (I3D)
63.8
mAP IOU@0.4
· 2022-07-14
Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning
Code
#21
TSP
SOTA
63.3
mAP IOU@0.4
· 2020-11-23
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Code
#22
DCAN (TSN features)
62.7
mAP IOU@0.4
· 2021-12-07
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
Code
#23
GCM
60.8
mAP IOU@0.4
· 2021-12-01
Graph Convolutional Module for Temporal Action Localization in Videos
#24
VSGN
60.4
mAP IOU@0.4
· 2020-11-30
Video Self-Stitching Graph Network for Temporal Action Localization
Code
#25
DaoTAD
59.5
mAP IOU@0.4
· 2021-07-09
RGB Stream Is Enough for Temporal Action Detection
Code
#26
AGT (Ours)
58.1
mAP IOU@0.4
· 2021-01-21
Activity Graph Transformer for Temporal Action Localization
#27
P-GCN
SOTA
57.8
mAP IOU@0.4
· 2019-09-07
Graph Convolutional Networks for Temporal Action Localization
Code
#28
Decouple-SSAD
SOTA
54.1
mAP IOU@0.4
· 2019-04-16
Decoupling Localization and Classification in Single Shot Temporal Action Detection
Code
#29
TAL-Net
SOTA
48.5
mAP IOU@0.4
· 2018-04-20
Rethinking the Faster R-CNN Architecture for Temporal Action Localization
#30
ASM-Loc
46.8
mAP IOU@0.4
· 2022-03-29
ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization
Code
#31
CO2-Net
45.7
mAP IOU@0.4
· 2021-07-27
Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization
Code
#32
BSN UNet
45
mAP IOU@0.4
· 2018-06-08
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation
Code
#33
CBR-TS
SOTA
41.3
mAP IOU@0.4
· 2017-05-02
Cascaded Boundary Regression for Temporal Action Detection
#34
A2CL-PT
39
mAP IOU@0.4
· 2020-07-13
Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization
Code
#35
R-C3D
SOTA
35.6
mAP IOU@0.4
· 2017-03-22
R-C3D: Region Convolutional 3D Network for Temporal Activity Detection
Code
#36
TURN-FL-16 + S-CNN
SOTA
34.9
mAP IOU@0.4
· 2017-03-17
TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals
Code
#37
CDC
SOTA
29.4
mAP IOU@0.4
· 2017-03-04
CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos
Code
#38
S-CNN
SOTA
28.7
mAP IOU@0.4
· 2016-01-09
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs
Code
#39
Yeung et al.
SOTA
26.4
mAP IOU@0.4
· 2015-11-22
End-to-end Learning of Action Detection from Frame Glimpses in Videos
Code