TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Skeletal Graph Neural Networks for Hard 3D Pose E...

Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation

Ailing Zeng, Xiao Sun, Lei Yang, Nanxuan Zhao, Minhao Liu, Qiang Xu

2021-08-16ICCV 2021 103D Human Pose EstimationSkeleton Based Action RecognitionPose EstimationAction Recognition3D Pose Estimationgraph construction
PaperPDF

Abstract

Various deep learning techniques have been proposed to solve the single-view 2D-to-3D pose estimation problem. While the average prediction accuracy has been improved significantly over the years, the performance on hard poses with depth ambiguity, self-occlusion, and complex or rare poses is still far from satisfactory. In this work, we target these hard poses and present a novel skeletal GNN learning solution. To be specific, we propose a hop-aware hierarchical channel-squeezing fusion layer to effectively extract relevant information from neighboring nodes while suppressing undesired noises in GNN learning. In addition, we propose a temporal-aware dynamic graph construction procedure that is robust and effective for 3D pose estimation. Experimental results on the Human3.6M dataset show that our solution achieves 10.3\% average prediction accuracy improvement and greatly improves on hard poses over state-of-the-art techniques. We further apply the proposed technique on the skeleton-based action recognition task and also achieve state-of-the-art performance. Our code is available at https://github.com/ailingzengzzz/Skeletal-GNN.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPAUC46.2Skeletal GNN
3D Human Pose EstimationMPI-INF-3DHPPCK82.1Skeletal GNN
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)47.9Skeletal GNN
VideoNTU RGB+D 120Accuracy (Cross-Setup)89.2Skeletal GNN
VideoNTU RGB+D 120Accuracy (Cross-Subject)87.5Skeletal GNN
VideoNTU RGB+D 120Ensembled Modalities4Skeletal GNN
VideoNTU RGB+DAccuracy (CS)91.6Skeletal GNN
VideoNTU RGB+DAccuracy (CV)96.7Skeletal GNN
VideoNTU RGB+DEnsembled Modalities4Skeletal GNN
Temporal Action LocalizationNTU RGB+D 120Accuracy (Cross-Setup)89.2Skeletal GNN
Temporal Action LocalizationNTU RGB+D 120Accuracy (Cross-Subject)87.5Skeletal GNN
Temporal Action LocalizationNTU RGB+D 120Ensembled Modalities4Skeletal GNN
Temporal Action LocalizationNTU RGB+DAccuracy (CS)91.6Skeletal GNN
Temporal Action LocalizationNTU RGB+DAccuracy (CV)96.7Skeletal GNN
Temporal Action LocalizationNTU RGB+DEnsembled Modalities4Skeletal GNN
Zero-Shot LearningNTU RGB+D 120Accuracy (Cross-Setup)89.2Skeletal GNN
Zero-Shot LearningNTU RGB+D 120Accuracy (Cross-Subject)87.5Skeletal GNN
Zero-Shot LearningNTU RGB+D 120Ensembled Modalities4Skeletal GNN
Zero-Shot LearningNTU RGB+DAccuracy (CS)91.6Skeletal GNN
Zero-Shot LearningNTU RGB+DAccuracy (CV)96.7Skeletal GNN
Zero-Shot LearningNTU RGB+DEnsembled Modalities4Skeletal GNN
Activity RecognitionNTU RGB+D 120Accuracy (Cross-Setup)89.2Skeletal GNN
Activity RecognitionNTU RGB+D 120Accuracy (Cross-Subject)87.5Skeletal GNN
Activity RecognitionNTU RGB+D 120Ensembled Modalities4Skeletal GNN
Activity RecognitionNTU RGB+DAccuracy (CS)91.6Skeletal GNN
Activity RecognitionNTU RGB+DAccuracy (CV)96.7Skeletal GNN
Activity RecognitionNTU RGB+DEnsembled Modalities4Skeletal GNN
Action LocalizationNTU RGB+D 120Accuracy (Cross-Setup)89.2Skeletal GNN
Action LocalizationNTU RGB+D 120Accuracy (Cross-Subject)87.5Skeletal GNN
Action LocalizationNTU RGB+D 120Ensembled Modalities4Skeletal GNN
Action LocalizationNTU RGB+DAccuracy (CS)91.6Skeletal GNN
Action LocalizationNTU RGB+DAccuracy (CV)96.7Skeletal GNN
Action LocalizationNTU RGB+DEnsembled Modalities4Skeletal GNN
Pose EstimationMPI-INF-3DHPAUC46.2Skeletal GNN
Pose EstimationMPI-INF-3DHPPCK82.1Skeletal GNN
Pose EstimationHuman3.6MAverage MPJPE (mm)47.9Skeletal GNN
Action DetectionNTU RGB+D 120Accuracy (Cross-Setup)89.2Skeletal GNN
Action DetectionNTU RGB+D 120Accuracy (Cross-Subject)87.5Skeletal GNN
Action DetectionNTU RGB+D 120Ensembled Modalities4Skeletal GNN
Action DetectionNTU RGB+DAccuracy (CS)91.6Skeletal GNN
Action DetectionNTU RGB+DAccuracy (CV)96.7Skeletal GNN
Action DetectionNTU RGB+DEnsembled Modalities4Skeletal GNN
3D Action RecognitionNTU RGB+D 120Accuracy (Cross-Setup)89.2Skeletal GNN
3D Action RecognitionNTU RGB+D 120Accuracy (Cross-Subject)87.5Skeletal GNN
3D Action RecognitionNTU RGB+D 120Ensembled Modalities4Skeletal GNN
3D Action RecognitionNTU RGB+DAccuracy (CS)91.6Skeletal GNN
3D Action RecognitionNTU RGB+DAccuracy (CV)96.7Skeletal GNN
3D Action RecognitionNTU RGB+DEnsembled Modalities4Skeletal GNN
3DMPI-INF-3DHPAUC46.2Skeletal GNN
3DMPI-INF-3DHPPCK82.1Skeletal GNN
3DHuman3.6MAverage MPJPE (mm)47.9Skeletal GNN
Action RecognitionNTU RGB+D 120Accuracy (Cross-Setup)89.2Skeletal GNN
Action RecognitionNTU RGB+D 120Accuracy (Cross-Subject)87.5Skeletal GNN
Action RecognitionNTU RGB+D 120Ensembled Modalities4Skeletal GNN
Action RecognitionNTU RGB+DAccuracy (CS)91.6Skeletal GNN
Action RecognitionNTU RGB+DAccuracy (CV)96.7Skeletal GNN
Action RecognitionNTU RGB+DEnsembled Modalities4Skeletal GNN
1 Image, 2*2 StitchiMPI-INF-3DHPAUC46.2Skeletal GNN
1 Image, 2*2 StitchiMPI-INF-3DHPPCK82.1Skeletal GNN
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)47.9Skeletal GNN

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17Efficiently Constructing Sparse Navigable Graphs2025-07-17NGTM: Substructure-based Neural Graph Topic Model for Interpretable Graph Generation2025-07-17