Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Video
/
THUMOS’14
Video on THUMOS’14
Metric: mAP IOU@0.7 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
Sort:
mAP IOU@0.7 (best first)
mAP IOU@0.7 (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
mAP IOU@0.7
▼
Extra Data
Paper
Date
↕
Code
1
AdaTAD (VideoMAEv2-giant)
56.1
No
End-to-End Temporal Action Detection with 1B Par...
2023-11-28
Code
2
RDFA-S6 (InternVideo2-6B)
51.9
No
Enhancing Temporal Action Localization: Advanced...
2024-07-18
Code
3
ActionMamba(InternVideo2-6B)
50.82
No
Video Mamba Suite: State Space Model as a Versat...
2024-03-14
Code
4
TriDet (VideoMAE v2-g feature)
48.8
Yes
Temporal Action Localization with Enhanced Insta...
2023-09-11
Code
5
ActionFormer (VideoMAE V2-g features)
47.7
Yes
VideoMAE V2: Scaling Video Masked Autoencoders w...
2023-03-29
Code
6
TriDet (I3D features)
47.4
No
TriDet: Temporal Action Detection with Relative ...
2023-03-13
Code
7
ASL(I3D features)
45.8
No
Action Sensitivity Learning for Temporal Action ...
2023-05-25
-
8
TemporalMaxer (I3D features)
44.7
No
TemporalMaxer: Maximize Temporal Context with on...
2023-03-16
Code
9
DualDETR (I3D features)
44.4
No
Dual DETRs for Multi-Label Temporal Action Detec...
2024-03-31
-
10
ActionFormer (I3D features)
43.9
No
ActionFormer: Localizing Moments of Actions with...
2022-02-16
Code
11
TadML(two-stream)
39.6
No
TadML: A fast temporal action detection with Mec...
2022-06-07
Code
12
BasicTAD (160,6,192,R50-SlowOnly)
37.4
No
BasicTAD: an Astounding RGB-Only Baseline for Te...
2022-05-05
Code
13
ReAct (TSN features)
35.6
No
ReAct: Temporal Action Detection with Relational...
2022-07-14
Code
14
E2E-TAD (SlowFast R50+TadTR)
34.9
No
An Empirical Study of End-to-End Temporal Action...
2022-04-06
Code
15
BasicTAD (112,3,96,R50-SlowOnly)
33.5
No
BasicTAD: an Astounding RGB-Only Baseline for Te...
2022-05-05
Code
16
TadTR
32.8
No
End-to-end Temporal Action Detection with Transf...
2021-06-18
Code
17
DCAN (TSN features)
32.6
No
DCAN: Improving Temporal Action Detection via Du...
2021-12-07
Code
18
TadML(rgb-only)
31.88
No
TadML: A fast temporal action detection with Mec...
2022-06-07
Code
19
TAGS (I3D)
31.8
No
Proposal-Free Temporal Action Detection via Glob...
2022-07-14
Code
20
MUSES
31
No
Multi-shot Temporal Event Localization: a Benchm...
2020-12-17
Code
21
VSGN
30.4
No
Video Self-Stitching Graph Network for Temporal ...
2020-11-30
Code
22
DaoTAD
30.1
No
RGB Stream Is Enough for Temporal Action Detection
2021-07-09
Code
23
AVFusion
28.8
Yes
Hear Me Out: Fusional Approaches for Audio Augme...
2021-06-27
Code
24
TSP
26
No
TSP: Temporally-Sensitive Pretraining of Video E...
2020-11-23
Code
25
TAL-Net
20.8
No
Rethinking the Faster R-CNN Architecture for Tem...
2018-04-20
-
26
BSN UNet
20
No
BSN: Boundary Sensitive Network for Temporal Act...
2018-06-08
Code
27
Decouple-SSAD
19.1
No
Decoupling Localization and Classification in Si...
2019-04-16
Code
28
CO2-Net
13.4
No
Cross-modal Consensus Network for Weakly Supervi...
2021-07-27
Code
29
ASM-Loc
13.4
No
ASM-Loc: Action-aware Segment Modeling for Weakl...
2022-03-29
Code
30
A2CL-PT
10.6
No
Adversarial Background-Aware Loss for Weakly-sup...
2020-07-13
Code
31
CBR-TS
9.9
No
Cascaded Boundary Regression for Temporal Action...
2017-05-02
-
32
DeepMetricLearner
9.7
No
Weakly Supervised Temporal Action Localization U...
2020-01-21
Code
33
CDC
7.9
No
CDC: Convolutional-De-Convolutional Networks for...
2017-03-04
Code
#1
AdaTAD (VideoMAEv2-giant)
SOTA
56.1
mAP IOU@0.7
· 2023-11-28
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
Code
#2
RDFA-S6 (InternVideo2-6B)
51.9
mAP IOU@0.7
· 2024-07-18
Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent Mechanism
Code
#3
ActionMamba(InternVideo2-6B)
50.82
mAP IOU@0.7
· 2024-03-14
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
Code
#4
TriDet (VideoMAE v2-g feature)
SOTA
48.8
mAP IOU@0.7
· Extra Data
· 2023-09-11
Temporal Action Localization with Enhanced Instant Discriminability
Code
#5
ActionFormer (VideoMAE V2-g features)
SOTA
47.7
mAP IOU@0.7
· Extra Data
· 2023-03-29
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Code
#6
TriDet (I3D features)
SOTA
47.4
mAP IOU@0.7
· 2023-03-13
TriDet: Temporal Action Detection with Relative Boundary Modeling
Code
#7
ASL(I3D features)
45.8
mAP IOU@0.7
· 2023-05-25
Action Sensitivity Learning for Temporal Action Localization
#8
TemporalMaxer (I3D features)
44.7
mAP IOU@0.7
· 2023-03-16
TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization
Code
#9
DualDETR (I3D features)
44.4
mAP IOU@0.7
· 2024-03-31
Dual DETRs for Multi-Label Temporal Action Detection
#10
ActionFormer (I3D features)
SOTA
43.9
mAP IOU@0.7
· 2022-02-16
ActionFormer: Localizing Moments of Actions with Transformers
Code
#11
TadML(two-stream)
39.6
mAP IOU@0.7
· 2022-06-07
TadML: A fast temporal action detection with Mechanics-MLP
Code
#12
BasicTAD (160,6,192,R50-SlowOnly)
37.4
mAP IOU@0.7
· 2022-05-05
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
Code
#13
ReAct (TSN features)
35.6
mAP IOU@0.7
· 2022-07-14
ReAct: Temporal Action Detection with Relational Queries
Code
#14
E2E-TAD (SlowFast R50+TadTR)
34.9
mAP IOU@0.7
· 2022-04-06
An Empirical Study of End-to-End Temporal Action Detection
Code
#15
BasicTAD (112,3,96,R50-SlowOnly)
33.5
mAP IOU@0.7
· 2022-05-05
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
Code
#16
TadTR
SOTA
32.8
mAP IOU@0.7
· 2021-06-18
End-to-end Temporal Action Detection with Transformer
Code
#17
DCAN (TSN features)
32.6
mAP IOU@0.7
· 2021-12-07
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
Code
#18
TadML(rgb-only)
31.88
mAP IOU@0.7
· 2022-06-07
TadML: A fast temporal action detection with Mechanics-MLP
Code
#19
TAGS (I3D)
31.8
mAP IOU@0.7
· 2022-07-14
Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning
Code
#20
MUSES
SOTA
31
mAP IOU@0.7
· 2020-12-17
Multi-shot Temporal Event Localization: a Benchmark
Code
#21
VSGN
SOTA
30.4
mAP IOU@0.7
· 2020-11-30
Video Self-Stitching Graph Network for Temporal Action Localization
Code
#22
DaoTAD
30.1
mAP IOU@0.7
· 2021-07-09
RGB Stream Is Enough for Temporal Action Detection
Code
#23
AVFusion
28.8
mAP IOU@0.7
· Extra Data
· 2021-06-27
Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization
Code
#24
TSP
SOTA
26
mAP IOU@0.7
· 2020-11-23
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Code
#25
TAL-Net
SOTA
20.8
mAP IOU@0.7
· 2018-04-20
Rethinking the Faster R-CNN Architecture for Temporal Action Localization
#26
BSN UNet
20
mAP IOU@0.7
· 2018-06-08
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation
Code
#27
Decouple-SSAD
19.1
mAP IOU@0.7
· 2019-04-16
Decoupling Localization and Classification in Single Shot Temporal Action Detection
Code
#28
CO2-Net
13.4
mAP IOU@0.7
· 2021-07-27
Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization
Code
#29
ASM-Loc
13.4
mAP IOU@0.7
· 2022-03-29
ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization
Code
#30
A2CL-PT
10.6
mAP IOU@0.7
· 2020-07-13
Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization
Code
#31
CBR-TS
SOTA
9.9
mAP IOU@0.7
· 2017-05-02
Cascaded Boundary Regression for Temporal Action Detection
#32
DeepMetricLearner
9.7
mAP IOU@0.7
· 2020-01-21
Weakly Supervised Temporal Action Localization Using Deep Metric Learning
Code
#33
CDC
SOTA
7.9
mAP IOU@0.7
· 2017-03-04
CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos
Code