Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Action Localization
/
THUMOS’14
Action Localization on THUMOS’14
Metric: mAP IOU@0.5 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
#
Model
↕
mAP IOU@0.5
▼
Extra Data
Paper
Date
↕
Code
1
AdaTAD (VideoMAEv2-giant)
80.9
No
End-to-End Temporal Action Detection with 1B Par...
2023-11-28
Code
2
RDFA-S6 (InternVideo2-6B)
78.2
No
Enhancing Temporal Action Localization: Advanced...
2024-07-18
Code
3
ActionMamba(InternVideo2-6B)
76.9
No
Video Mamba Suite: State Space Model as a Versat...
2024-03-14
Code
4
TriDet (VideoMAE v2-g feature)
73.3
Yes
Temporal Action Localization with Enhanced Insta...
2023-09-11
Code
5
ActionFormer (VideoMAE V2-g features)
73
Yes
VideoMAE V2: Scaling Video Masked Autoencoders w...
2023-03-29
Code
6
TriDet (I3D features)
72.9
No
TriDet: Temporal Action Detection with Relative ...
2023-03-13
Code
7
TemporalMaxer (I3D features)
71.8
No
TemporalMaxer: Maximize Temporal Context with on...
2023-03-16
Code
8
ASL(I3D features)
71.7
No
Action Sensitivity Learning for Temporal Action ...
2023-05-25
-
9
ActionFormer (I3D features)
71
No
ActionFormer: Localizing Moments of Actions with...
2022-02-16
Code
10
DualDETR (I3D features)
70.4
No
Dual DETRs for Multi-Label Temporal Action Detec...
2024-03-31
-
11
BasicTAD (160,6,192,R50-SlowOnly)
63.5
No
BasicTAD: an Astounding RGB-Only Baseline for Te...
2022-05-05
Code
12
TadML(two-stream)
62.53
No
TadML: A fast temporal action detection with Mec...
2022-06-07
Code
13
TadTR
60.1
No
End-to-end Temporal Action Detection with Transf...
2021-06-18
Code
14
BasicTAD (112,3,96,R50-SlowOnly)
58.6
No
BasicTAD: an Astounding RGB-Only Baseline for Te...
2022-05-05
Code
15
ReAct (TSN features)
57.1
No
ReAct: Temporal Action Detection with Relational...
2022-07-14
Code
16
AVFusion
57.1
Yes
Hear Me Out: Fusional Approaches for Audio Augme...
2021-06-27
Code
17
TAGS (I3D)
57
No
Proposal-Free Temporal Action Detection via Glob...
2022-07-14
Code
18
MUSES
56.9
No
Multi-shot Temporal Event Localization: a Benchm...
2020-12-17
Code
19
TadML(rgb-only)
56.61
No
TadML: A fast temporal action detection with Mec...
2022-06-07
Code
20
E2E-TAD (SlowFast R50+TadTR)
56
No
An Empirical Study of End-to-End Temporal Action...
2022-04-06
Code
21
DCAN (TSN features)
54.1
No
DCAN: Improving Temporal Action Detection via Du...
2021-12-07
Code
22
DaoTAD
53.8
No
RGB Stream Is Enough for Temporal Action Detection
2021-07-09
Code
23
TSP
53.5
No
TSP: Temporally-Sensitive Pretraining of Video E...
2020-11-23
Code
24
VSGN
52.4
No
Video Self-Stitching Graph Network for Temporal ...
2020-11-30
Code
25
GCM
51.9
No
Graph Convolutional Module for Temporal Action L...
2021-12-01
-
26
AGT (Ours)
50.2
No
Activity Graph Transformer for Temporal Action L...
2021-01-21
-
27
P-GCN
49.1
No
Graph Convolutional Networks for Temporal Action...
2019-09-07
Code
28
Decouple-SSAD
44.2
No
Decoupling Localization and Classification in Si...
2019-04-16
Code
29
TAL-Net
42.8
No
Rethinking the Faster R-CNN Architecture for Tem...
2018-04-20
-
30
G-TAD
40.2
No
G-TAD: Sub-Graph Localization for Temporal Actio...
2019-11-26
Code
31
CO2-Net
38.3
No
Cross-modal Consensus Network for Weakly Supervi...
2021-07-27
Code
32
CO2-Net
38.3
No
Cross-modal Consensus Network for Weakly Supervi...
2021-07-27
Code
33
BSN UNet
36.9
No
BSN: Boundary Sensitive Network for Temporal Act...
2018-06-08
Code
34
ASM-Loc
36.6
No
ASM-Loc: Action-aware Segment Modeling for Weakl...
2022-03-29
Code
35
BMN
32.2
No
BMN: Boundary-Matching Network for Temporal Acti...
2019-07-23
Code
36
CBR-TS
31
No
Cascaded Boundary Regression for Temporal Action...
2017-05-02
-
37
A2CL-PT
30.1
No
Adversarial Background-Aware Loss for Weakly-sup...
2020-07-13
Code
38
DeepMetricLearner
29.6
No
Weakly Supervised Temporal Action Localization U...
2020-01-21
Code
39
R-C3D
28.9
No
R-C3D: Region Convolutional 3D Network for Tempo...
2017-03-22
Code
40
TURN-FL-16 + S-CNN
25.6
No
TURN TAP: Temporal Unit Regression Network for T...
2017-03-17
Code
41
CDC
23.3
No
CDC: Convolutional-De-Convolutional Networks for...
2017-03-04
Code
42
S-CNN
19
No
Temporal Action Localization in Untrimmed Videos...
2016-01-09
Code
43
Yeung et al.
17.1
No
End-to-end Learning of Action Detection from Fra...
2015-11-22
Code