Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Video
/
HACS
Video on HACS
Metric: Average-mAP (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
Average-mAP (best first)
Average-mAP (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
Average-mAP
▼
Extra Data
Paper
Date
↕
Code
1
RDFA-S6 (InternVideo2-6B)
45.8
No
Enhancing Temporal Action Localization: Advanced...
2024-07-18
Code
2
ActionMamba(InternVideo2-6B)
44.56
No
Video Mamba Suite: State Space Model as a Versat...
2024-03-14
Code
3
DyFADet(VideoMAEv2)
44.3
No
DyFADet: Dynamic Feature Aggregation for Tempora...
2024-07-03
Code
4
InternVideo2-6B
43.3
No
InternVideo2: Scaling Foundation Models for Mult...
2024-03-22
Code
5
TriDet (VideoMAEv2)
43.1
No
Temporal Action Localization with Enhanced Insta...
2023-09-11
Code
6
InternVideo2-1B
42.4
No
InternVideo2: Scaling Foundation Models for Mult...
2024-03-22
Code
7
InternVideo
41.55
No
InternVideo: General Video Foundation Models via...
2022-12-06
Code
8
TriDet (SlowFast)
38.6
No
TriDet: Temporal Action Detection with Relative ...
2023-03-13
Code
9
TriDet (I3D RGB)
36.8
No
TriDet: Temporal Action Detection with Relative ...
2023-03-13
Code
10
TadTr (I3D RGB)
32.09
No
End-to-end Temporal Action Detection with Transf...
2021-06-18
Code
11
LoFi+G-TAD (RGB, RN18)
24.64
No
-
-
-
12
SSN
18.97
No
HACS: Human Action Clips and Segments Dataset fo...
2017-12-26
Code
#1
RDFA-S6 (InternVideo2-6B)
SOTA
45.8
Average-mAP
· 2024-07-18
Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent Mechanism
Code
#2
ActionMamba(InternVideo2-6B)
SOTA
44.56
Average-mAP
· 2024-03-14
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
Code
#3
DyFADet(VideoMAEv2)
44.3
Average-mAP
· 2024-07-03
DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
Code
#4
InternVideo2-6B
43.3
Average-mAP
· 2024-03-22
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding
Code
#5
TriDet (VideoMAEv2)
SOTA
43.1
Average-mAP
· 2023-09-11
Temporal Action Localization with Enhanced Instant Discriminability
Code
#6
InternVideo2-1B
42.4
Average-mAP
· 2024-03-22
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding
Code
#7
InternVideo
SOTA
41.55
Average-mAP
· 2022-12-06
InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Code
#8
TriDet (SlowFast)
38.6
Average-mAP
· 2023-03-13
TriDet: Temporal Action Detection with Relative Boundary Modeling
Code
#9
TriDet (I3D RGB)
36.8
Average-mAP
· 2023-03-13
TriDet: Temporal Action Detection with Relative Boundary Modeling
Code
#10
TadTr (I3D RGB)
SOTA
32.09
Average-mAP
· 2021-06-18
End-to-end Temporal Action Detection with Transformer
Code
#11
LoFi+G-TAD (RGB, RN18)
24.64
Average-mAP
No paper
#12
SSN
SOTA
18.97
Average-mAP
· 2017-12-26
HACS: Human Action Clips and Segments Dataset for Recognition and Temporal Localization
Code