TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Optimized Skeleton-based Action Recognition via Sparsified...

Optimized Skeleton-based Action Recognition via Sparsified Graph Regression

Xiang Gao, Wei Hu, Jiaxiang Tang, Jiaying Liu, Zongming Guo

2018-11-29regressionSkeleton Based Action RecognitionGraph RegressionAction RecognitionTemporal Action Localizationgraph construction
PaperPDF

Abstract

With the prevalence of accessible depth sensors, dynamic human body skeletons have attracted much attention as a robust modality for action recognition. Previous methods model skeletons based on RNN or CNN, which has limited expressive power for irregular skeleton joints. While graph convolutional networks (GCN) have been proposed to address irregular graph-structured data, the fundamental graph construction remains challenging. In this paper, we represent skeletons naturally on graphs, and propose a graph regression based GCN (GR-GCN) for skeleton-based action recognition, aiming to capture the spatio-temporal variation in the data. As the graph representation is crucial to graph convolution, we first propose graph regression to statistically learn the underlying graph from multiple observations. In particular, we provide spatio-temporal modeling of skeletons and pose an optimization problem on the graph structure over consecutive frames, which enforces the sparsity of the underlying graph for efficient representation. The optimized graph not only connects each joint to its neighboring joints in the same frame strongly or weakly, but also links with relevant joints in the previous and subsequent frames. We then feed the optimized graph into the GCN along with the coordinates of the skeleton sequence for feature learning, where we deploy high-order and fast Chebyshev approximation of spectral graph convolution. Further, we provide analysis of the variation characterization by the Chebyshev approximation. Experimental results validate the effectiveness of the proposed graph regression and show that the proposed GR-GCN achieves the state-of-the-art performance on the widely used NTU RGB+D, UT-Kinect and SYSU 3D datasets.

Results

TaskDatasetMetricValueModel
VideoNTU RGB+DAccuracy (CS)87.5GR-GCN
VideoNTU RGB+DAccuracy (CV)94.3GR-GCN
Temporal Action LocalizationNTU RGB+DAccuracy (CS)87.5GR-GCN
Temporal Action LocalizationNTU RGB+DAccuracy (CV)94.3GR-GCN
Zero-Shot LearningNTU RGB+DAccuracy (CS)87.5GR-GCN
Zero-Shot LearningNTU RGB+DAccuracy (CV)94.3GR-GCN
Activity RecognitionNTU RGB+DAccuracy (CS)87.5GR-GCN
Activity RecognitionNTU RGB+DAccuracy (CV)94.3GR-GCN
Action LocalizationNTU RGB+DAccuracy (CS)87.5GR-GCN
Action LocalizationNTU RGB+DAccuracy (CV)94.3GR-GCN
Action DetectionNTU RGB+DAccuracy (CS)87.5GR-GCN
Action DetectionNTU RGB+DAccuracy (CV)94.3GR-GCN
3D Action RecognitionNTU RGB+DAccuracy (CS)87.5GR-GCN
3D Action RecognitionNTU RGB+DAccuracy (CV)94.3GR-GCN
Action RecognitionNTU RGB+DAccuracy (CS)87.5GR-GCN
Action RecognitionNTU RGB+DAccuracy (CV)94.3GR-GCN

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17Efficiently Constructing Sparse Navigable Graphs2025-07-17NGTM: Substructure-based Neural Graph Topic Model for Interpretable Graph Generation2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16