TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Investigation of Different Skeleton Features for CNN-based...

Investigation of Different Skeleton Features for CNN-based 3D Action Recognition

Zewei Ding, Pichao Wang, Philip O. Ogunbona, Wanqing Li

2017-05-023D Action RecognitionSkeleton Based Action RecognitionAction RecognitionTemporal Action Localization
PaperPDFCode

Abstract

Deep learning techniques are being used in skeleton based action recognition tasks and outstanding performance has been reported. Compared with RNN based methods which tend to overemphasize temporal information, CNN-based approaches can jointly capture spatio-temporal information from texture color images encoded from skeleton sequences. There are several skeleton-based features that have proven effective in RNN-based and handcrafted-feature-based methods. However, it remains unknown whether they are suitable for CNN-based approaches. This paper proposes to encode five spatial skeleton features into images with different encoding methods. In addition, the performance implication of different joints used for feature extraction is studied. The proposed method achieved state-of-the-art performance on NTU RGB+D dataset for 3D human action analysis. An accuracy of 75.32\% was achieved in Large Scale 3D Human Activity Analysis Challenge in Depth Videos.

Results

TaskDatasetMetricValueModel
VideoNTU RGB+DAccuracy (CV)82.31Five Spatial Skeleton Features
Temporal Action LocalizationNTU RGB+DAccuracy (CV)82.31Five Spatial Skeleton Features
Zero-Shot LearningNTU RGB+DAccuracy (CV)82.31Five Spatial Skeleton Features
Activity RecognitionNTU RGB+DAccuracy (CV)82.31Five Spatial Skeleton Features
Action LocalizationNTU RGB+DAccuracy (CV)82.31Five Spatial Skeleton Features
Action DetectionNTU RGB+DAccuracy (CV)82.31Five Spatial Skeleton Features
3D Action RecognitionNTU RGB+DAccuracy (CV)82.31Five Spatial Skeleton Features
Action RecognitionNTU RGB+DAccuracy (CV)82.31Five Spatial Skeleton Features

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16Zero-shot Skeleton-based Action Recognition with Prototype-guided Feature Alignment2025-07-01EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception2025-06-26Feature Hallucination for Self-supervised Action Recognition2025-06-25CARMA: Context-Aware Situational Grounding of Human-Robot Group Interactions by Combining Vision-Language Models with Object and Action Recognition2025-06-25Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition2025-06-23Adapting Vision-Language Models for Evaluating World Models2025-06-22