TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Pose And Joint-Aware Action Recognition

Pose And Joint-Aware Action Recognition

Anshul Shah, Shlok Mishra, Ankan Bansal, Jun-Cheng Chen, Rama Chellappa, Abhinav Shrivastava

2020-10-16Action ClassificationOptical Flow EstimationSkeleton Based Action RecognitionData AugmentationAction RecognitionAction Recognition In VideosTemporal Action LocalizationActivity Recognition
PaperPDFCode(official)

Abstract

Recent progress on action recognition has mainly focused on RGB and optical flow features. In this paper, we approach the problem of joint-based action recognition. Unlike other modalities, constellation of joints and their motion generate models with succinct human motion information for activity recognition. We present a new model for joint-based action recognition, which first extracts motion features from each joint separately through a shared motion encoder before performing collective reasoning. Our joint selector module re-weights the joint information to select the most discriminative joints for the task. We also propose a novel joint-contrastive loss that pulls together groups of joint features which convey the same action. We strengthen the joint-based representations by using a geometry-aware data augmentation technique which jitters pose heatmaps while retaining the dynamics of the action. We show large improvements over the current state-of-the-art joint-based approaches on JHMDB, HMDB, Charades, AVA action recognition datasets. A late fusion with RGB and Flow-based approaches yields additional improvements. Our model also outperforms the existing baseline on Mimetics, a dataset with out-of-context actions.

Results

TaskDatasetMetricValueModel
VideoJHMDB (2D poses only)Average accuracy of 3 splits68.55JMRN (No GT pose)
VideoCharadesMAP43.23JMRN + R101-NL-LFB
VideoCharadesMAP16.2JMRN (Pose only)
Temporal Action LocalizationJHMDB (2D poses only)Average accuracy of 3 splits68.55JMRN (No GT pose)
Zero-Shot LearningJHMDB (2D poses only)Average accuracy of 3 splits68.55JMRN (No GT pose)
Activity RecognitionHMDB-51Average accuracy of 3 splits84.53Ours + ResNext101 BERT
Activity RecognitionHMDB-51Average accuracy of 3 splits54.2JRMN
Activity RecognitionAVA v2.1mAP (Val)28.4JMRN + SlowFast-R101-NL
Activity RecognitionMimeticsmAP40JMRN
Activity RecognitionMimeticsmAP38.3SIP-Net
Activity RecognitionJHMDB (2D poses only)Average accuracy of 3 splits68.55JMRN (No GT pose)
Action LocalizationJHMDB (2D poses only)Average accuracy of 3 splits68.55JMRN (No GT pose)
Action DetectionJHMDB (2D poses only)Average accuracy of 3 splits68.55JMRN (No GT pose)
3D Action RecognitionJHMDB (2D poses only)Average accuracy of 3 splits68.55JMRN (No GT pose)
Action RecognitionHMDB-51Average accuracy of 3 splits84.53Ours + ResNext101 BERT
Action RecognitionHMDB-51Average accuracy of 3 splits54.2JRMN
Action RecognitionAVA v2.1mAP (Val)28.4JMRN + SlowFast-R101-NL
Action RecognitionMimeticsmAP40JMRN
Action RecognitionMimeticsmAP38.3SIP-Net
Action RecognitionJHMDB (2D poses only)Average accuracy of 3 splits68.55JMRN (No GT pose)

Related Papers

Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16Data Augmentation in Time Series Forecasting through Inverted Framework2025-07-15ZKP-FedEval: Verifiable and Privacy-Preserving Federated Evaluation using Zero-Knowledge Proofs2025-07-15