TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Unifying Graph Embedding Features with Graph Convolutional...

Unifying Graph Embedding Features with Graph Convolutional Networks for Skeleton-based Action Recognition

Dong Yang, Monica Mengqi Li, Hong Fu, Jicong Fan, Zhao Zhang, Howard Leung

2020-03-06Skeleton Based Action RecognitionAction RecognitionTemporal Action LocalizationGraph Embedding
PaperPDF

Abstract

Combining skeleton structure with graph convolutional networks has achieved remarkable performance in human action recognition. Since current research focuses on designing basic graph for representing skeleton data, these embedding features contain basic topological information, which cannot learn more systematic perspectives from skeleton data. In this paper, we overcome this limitation by proposing a novel framework, which unifies 15 graph embedding features into the graph convolutional network for human action recognition, aiming to best take advantage of graph information to distinguish key joints, bones, and body parts in human action, instead of being exclusive to a single feature or domain. Additionally, we fully investigate how to find the best graph features of skeleton structure for improving human action recognition. Besides, the topological information of the skeleton sequence is explored to further enhance the performance in a multi-stream framework. Moreover, the unified graph features are extracted by the adaptive methods on the training process, which further yields improvements. Our model is validated by three large-scale datasets, namely NTU-RGB+D, Kinetics and SYSU-3D, and outperforms the state-of-the-art methods. Overall, our work unified graph embedding features to promotes systematic research on human action recognition.

Results

TaskDatasetMetricValueModel
VideoKinetics-Skeleton datasetAccuracy37.5CGCN
VideoNTU RGB+DAccuracy (CS)90.3CGCN
VideoNTU RGB+DAccuracy (CV)96.4CGCN
Temporal Action LocalizationKinetics-Skeleton datasetAccuracy37.5CGCN
Temporal Action LocalizationNTU RGB+DAccuracy (CS)90.3CGCN
Temporal Action LocalizationNTU RGB+DAccuracy (CV)96.4CGCN
Zero-Shot LearningKinetics-Skeleton datasetAccuracy37.5CGCN
Zero-Shot LearningNTU RGB+DAccuracy (CS)90.3CGCN
Zero-Shot LearningNTU RGB+DAccuracy (CV)96.4CGCN
Activity RecognitionKinetics-Skeleton datasetAccuracy37.5CGCN
Activity RecognitionNTU RGB+DAccuracy (CS)90.3CGCN
Activity RecognitionNTU RGB+DAccuracy (CV)96.4CGCN
Action LocalizationKinetics-Skeleton datasetAccuracy37.5CGCN
Action LocalizationNTU RGB+DAccuracy (CS)90.3CGCN
Action LocalizationNTU RGB+DAccuracy (CV)96.4CGCN
Action DetectionKinetics-Skeleton datasetAccuracy37.5CGCN
Action DetectionNTU RGB+DAccuracy (CS)90.3CGCN
Action DetectionNTU RGB+DAccuracy (CV)96.4CGCN
3D Action RecognitionKinetics-Skeleton datasetAccuracy37.5CGCN
3D Action RecognitionNTU RGB+DAccuracy (CS)90.3CGCN
3D Action RecognitionNTU RGB+DAccuracy (CV)96.4CGCN
Action RecognitionKinetics-Skeleton datasetAccuracy37.5CGCN
Action RecognitionNTU RGB+DAccuracy (CS)90.3CGCN
Action RecognitionNTU RGB+DAccuracy (CV)96.4CGCN

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17SMART: Relation-Aware Learning of Geometric Representations for Knowledge Graphs2025-07-17DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16Zero-shot Skeleton-based Action Recognition with Prototype-guided Feature Alignment2025-07-01EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception2025-06-26Feature Hallucination for Self-supervised Action Recognition2025-06-25CARMA: Context-Aware Situational Grounding of Human-Robot Group Interactions by Combining Vision-Language Models with Object and Action Recognition2025-06-25Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition2025-06-23