Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Robots
/
Activity Recognition
/
HMDB51 (finetuned)
Activity Recognition on HMDB51 (finetuned)
Metric: Top-1 Accuracy (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
Top-1 Accuracy (best first)
Top-1 Accuracy (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
Top-1 Accuracy
▼
Extra Data
Paper
Date
↕
Code
1
BraVe:V-FA (TSM-50x2)
77.8
No
Broaden Your Views for Self-Supervised Video Lea...
2021-03-30
Code
2
MMV
70.1
No
Self-Supervised MultiModal Versatile Networks
2020-06-29
Code
3
CVRL (R3D-152 2x; K600)
69.9
No
Spatiotemporal Contrastive Video Representation ...
2020-08-09
Code
4
XDC
68.9
No
Self-Supervised Learning by Cross-Modal Audio-Vi...
2019-11-28
Code
5
CVRL (R3D-50; K600)
68
No
Spatiotemporal Contrastive Video Representation ...
2020-08-09
Code
6
ELo
67.4
No
Evolving Losses for Unsupervised Video Represent...
2020-02-26
-
7
CVRL (R3D-50; K400)
66.7
No
Spatiotemporal Contrastive Video Representation ...
2020-08-09
Code
8
AVID
64.7
No
Audio-Visual Instance Discrimination with Cross-...
2020-04-27
Code
9
ViCC (S3D; R+F)
62.2
No
Self-supervised Video Representation Learning wi...
2021-06-18
Code
10
AVTS
61.6
No
Cooperative Learning of Audio and Video Models f...
2018-06-30
-
11
ViCC (R2+1D; R+F)
61.5
No
Self-supervised Video Representation Learning wi...
2021-06-18
Code
12
CoCLR
54.6
No
Self-supervised Co-training for Video Representa...
2020-10-19
Code
13
ViCC (R2+1D; RGB)
52.4
No
Self-supervised Video Representation Learning wi...
2021-06-18
Code
14
ViCC (S3D; RGB))
47.9
No
Self-supervised Video Representation Learning wi...
2021-06-18
Code
#1
BraVe:V-FA (TSM-50x2)
SOTA
77.8
Top-1 Accuracy
· 2021-03-30
Broaden Your Views for Self-Supervised Video Learning
Code
#2
MMV
SOTA
70.1
Top-1 Accuracy
· 2020-06-29
Self-Supervised MultiModal Versatile Networks
Code
#3
CVRL (R3D-152 2x; K600)
69.9
Top-1 Accuracy
· 2020-08-09
Spatiotemporal Contrastive Video Representation Learning
Code
#4
XDC
SOTA
68.9
Top-1 Accuracy
· 2019-11-28
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Code
#5
CVRL (R3D-50; K600)
68
Top-1 Accuracy
· 2020-08-09
Spatiotemporal Contrastive Video Representation Learning
Code
#6
ELo
67.4
Top-1 Accuracy
· 2020-02-26
Evolving Losses for Unsupervised Video Representation Learning
#7
CVRL (R3D-50; K400)
66.7
Top-1 Accuracy
· 2020-08-09
Spatiotemporal Contrastive Video Representation Learning
Code
#8
AVID
64.7
Top-1 Accuracy
· 2020-04-27
Audio-Visual Instance Discrimination with Cross-Modal Agreement
Code
#9
ViCC (S3D; R+F)
62.2
Top-1 Accuracy
· 2021-06-18
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
Code
#10
AVTS
SOTA
61.6
Top-1 Accuracy
· 2018-06-30
Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization
#11
ViCC (R2+1D; R+F)
61.5
Top-1 Accuracy
· 2021-06-18
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
Code
#12
CoCLR
54.6
Top-1 Accuracy
· 2020-10-19
Self-supervised Co-training for Video Representation Learning
Code
#13
ViCC (R2+1D; RGB)
52.4
Top-1 Accuracy
· 2021-06-18
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
Code
#14
ViCC (S3D; RGB))
47.9
Top-1 Accuracy
· 2021-06-18
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
Code