TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Robots/Activity Recognition/HMDB51

Activity Recognition on HMDB51

Metric: Top-1 Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Top-1 Accuracy▼Extra DataPaperDate↕Code
1MVD (ViT-B)79.7NoMasked Video Distillation: Rethinking Masked Fea...2022-12-08Code
2M3Video78NoMasked Motion Encoding for Self-Supervised Video...2022-10-12Code
3pBYOL75NoA Large-Scale Study on Unsupervised Spatiotempor...2021-04-29Code
4SCE (R3D-50)74.7NoSimilarity Contrastive Estimation for Image and ...2022-12-21Code
5VideoMAE73.3NoVideoMAE: Masked Autoencoders are Data-Efficient...2022-03-23Code
6BraVe:V-FA (TSM-50x2)70.5NoBroaden Your Views for Self-Supervised Video Lea...2021-03-30Code
7CVRL (R3D-152 2x; K600)69.9NoSpatiotemporal Contrastive Video Representation ...2020-08-09Code
8XKD (ViT-B/112/16)69NoXKD: Cross-modal Knowledge Distillation with Dom...2022-11-25Code
9XDC68.9NoSelf-Supervised Learning by Cross-Modal Audio-Vi...2019-11-28Code
10CVRL (R3D-50; K600)68NoSpatiotemporal Contrastive Video Representation ...2020-08-09Code
11CrissCross (AudioSet)66.8NoSelf-Supervised Audio-Visual Representation Lear...2021-11-09Code
12CVRL (R3D-50; K400)66.7NoSpatiotemporal Contrastive Video Representation ...2020-08-09Code
13XDC66.5NoSelf-Supervised Learning by Cross-Modal Audio-Vi...2019-11-28Code
14XKD-Modality-Agnostic (ViT-B/112/16)65.9NoXKD: Cross-modal Knowledge Distillation with Dom...2022-11-25Code
15VideoMS (ViT-B)65.8NoEVEREST: Efficient Masked Video Autoencoder by R...2022-11-19Code
16AVID+CMA (Modified R2+1D-18 on Audioset)64.7NoAudio-Visual Instance Discrimination with Cross-...2020-04-27Code
17RSPNet64.7NoRSPNet: Relative Speed Perception for Unsupervis...2020-10-27Code
18CrissCross (Kinetics400)64.7NoSelf-Supervised Audio-Visual Representation Lear...2021-11-09Code
19ELo64.5NoEvolving Losses for Unsupervised Video Represent...2020-02-26-
20AVID (Modified R2+1D-18 on Audioset)64.1NoAudio-Visual Instance Discrimination with Cross-...2020-04-27Code
21XDC63.7NoSelf-Supervised Learning by Cross-Modal Audio-Vi...2019-11-28Code
22VideoMAE(no extra data)62.6NoVideoMAE: Masked Autoencoders are Data-Efficient...2022-03-23Code
23ViCC (S3D; R+F)62.2NoSelf-supervised Video Representation Learning wi...2021-06-18Code
24ViCC (R2+1D; R+F)61.5NoSelf-supervised Video Representation Learning wi...2021-06-18Code
25AVID+CMA (Modified R2+1D-18 on Kinetics)60.8NoAudio-Visual Instance Discrimination with Cross-...2020-04-27Code
26CrissCross (Kinetics-Sound)60.5NoSelf-Supervised Audio-Visual Representation Lear...2021-11-09Code
27AVID (Modified R2+1D-18 on Kinetics)59.9NoAudio-Visual Instance Discrimination with Cross-...2020-04-27Code
28MCN (R3D-18; RGB)54.8NoSelf-Supervised Video Representation Learning wi...2021-08-19-
29MCN (R2+1D; RGB)54.5NoSelf-Supervised Video Representation Learning wi...2021-08-19-
30SLIC (R3D-18)54.5NoSLIC: Self-Supervised Learning with Iterative Cl...2022-06-25Code
31TCLR (R3D-18)52.9NoTCLR: Temporal Contrastive Learning for Video Re...2021-01-20Code
32XDC52.6NoSelf-Supervised Learning by Cross-Modal Audio-Vi...2019-11-28Code
33ViCC (R2+1D; RGB)52.4NoSelf-supervised Video Representation Learning wi...2021-06-18Code
34CoCLR46.1NoSelf-supervised Co-training for Video Representa...2020-10-19Code
35PCL (ResNet-18)43.2NoPretext-Contrastive Learning: Toward Good Practi...2020-10-29Code
36ViCC (S3D; RGB)38.5NoSelf-supervised Video Representation Learning wi...2021-06-18Code
37IIC (R3D)38.3NoSelf-supervised Video Representation Learning Us...2020-08-06Code
38TCE (ResNet-50)36.6NoTemporally Coherent Embeddings for Self-Supervis...2020-03-21Code
39DPC (Modified 3D Resnet-34)35.7NoVideo Representation Learning by Dense Predictiv...2019-09-10Code
40DPC (Modified 3D ResNet-18)34.5NoVideo Representation Learning by Dense Predictiv...2019-09-10Code
41TCE (ResNet-18)34.2NoTemporally Coherent Embeddings for Self-Supervis...2020-03-21Code
423D RotNet (3D ResNet-18)33.7NoSelf-Supervised Spatiotemporal Feature Learning ...2018-11-28-
433D Cubic Puzzles (3D ResNet-18)33.7NoSelf-Supervised Video Representation Learning wi...2018-11-24-
44VCP (R3D)31.5NoVideo Cloze Procedure for Self-Supervised Spatio...2020-01-02Code
45Video Clip Ordering (R3D)29.5No---
46OPN (VGG-M-2048)23.8NoUnsupervised Representation Learning by Sorting ...2017-08-03Code
47Motion & Appearance (C3D)20.3NoSelf-supervised Spatio-temporal Representation L...2019-04-07Code
48Shuffle and Learn (AlexNet)19.8NoShuffle and Learn: Unsupervised Learning using T...2016-03-28-