TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Dynamical Human-Joint Affinity for 3D Pose Estima...

Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos

Junhao Zhang, Yali Wang, Zhipeng Zhou, Tianyu Luan, Zhe Wang, Yu Qiao

2021-09-153D Human Pose EstimationPose Estimation3D Pose Estimation
PaperPDF

Abstract

Graph Convolution Network (GCN) has been successfully used for 3D human pose estimation in videos. However, it is often built on the fixed human-joint affinity, according to human skeleton. This may reduce adaptation capacity of GCN to tackle complex spatio-temporal pose variations in videos. To alleviate this problem, we propose a novel Dynamical Graph Network (DG-Net), which can dynamically identify human-joint affinity, and estimate 3D pose by adaptively learning spatial/temporal joint relations from videos. Different from traditional graph convolution, we introduce Dynamical Spatial/Temporal Graph convolution (DSG/DTG) to discover spatial/temporal human-joint affinity for each video exemplar, depending on spatial distance/temporal movement similarity between human joints in this video. Hence, they can effectively understand which joints are spatially closer and/or have consistent motion, for reducing depth ambiguity and/or motion uncertainty when lifting 2D pose to 3D pose. We conduct extensive experiments on three popular benchmarks, e.g., Human3.6M, HumanEva-I, and MPI-INF-3DHP, where DG-Net outperforms a number of recent SOTA approaches with fewer input frames and model size.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationHumanEva-IMean Reconstruction Error (mm)19.5DG-Net (T=4)
3D Human Pose EstimationMPI-INF-3DHPAUC53.8DG-Net (T=4)
3D Human Pose EstimationMPI-INF-3DHPMPJPE76DG-Net (T=4)
Pose EstimationHumanEva-IMean Reconstruction Error (mm)19.5DG-Net (T=4)
Pose EstimationMPI-INF-3DHPAUC53.8DG-Net (T=4)
Pose EstimationMPI-INF-3DHPMPJPE76DG-Net (T=4)
3DHumanEva-IMean Reconstruction Error (mm)19.5DG-Net (T=4)
3DMPI-INF-3DHPAUC53.8DG-Net (T=4)
3DMPI-INF-3DHPMPJPE76DG-Net (T=4)
1 Image, 2*2 StitchiHumanEva-IMean Reconstruction Error (mm)19.5DG-Net (T=4)
1 Image, 2*2 StitchiMPI-INF-3DHPAUC53.8DG-Net (T=4)
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE76DG-Net (T=4)

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16