TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solu...

HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation

Jiefeng Li, Chao Xu, Zhicun Chen, Siyuan Bian, Lixin Yang, Cewu Lu

2020-11-30CVPR 2021 13D Human Pose EstimationPose Estimation3D human pose and shape estimationKeypoint Estimation
PaperPDFCodeCodeCode(official)Code

Abstract

Model-based 3D pose and shape estimation methods reconstruct a full 3D mesh for the human body by estimating several parameters. However, learning the abstract parameters is a highly non-linear process and suffers from image-model misalignment, leading to mediocre model performance. In contrast, 3D keypoint estimation methods combine deep CNN network with the volumetric representation to achieve pixel-level localization accuracy but may predict unrealistic body structure. In this paper, we address the above issues by bridging the gap between body mesh estimation and 3D keypoint estimation. We propose a novel hybrid inverse kinematics solution (HybrIK). HybrIK directly transforms accurate 3D joints to relative body-part rotations for 3D body mesh reconstruction, via the twist-and-swing decomposition. The swing rotation is analytically solved with 3D joints, and the twist rotation is derived from the visual cues through the neural network. We show that HybrIK preserves both the accuracy of 3D pose and the realistic body structure of the parametric human model, leading to a pixel-aligned 3D body mesh and a more accurate 3D pose than the pure 3D keypoint estimation methods. Without bells and whistles, the proposed method surpasses the state-of-the-art methods by a large margin on various 3D human pose and shape benchmarks. As an illustrative example, HybrIK outperforms all the previous methods by 13.2 mm MPJPE and 21.9 mm PVE on 3DPW dataset. Our code is available at https://github.com/Jeff-sjtu/HybrIK.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationEMDBAverage MPJAE (deg)24.5174HybrIK
3D Human Pose EstimationEMDBAverage MPJAE-PA (deg)23.0704HybrIK
3D Human Pose EstimationEMDBAverage MPJPE (mm)103.037HybrIK
3D Human Pose EstimationEMDBAverage MPJPE-PA (mm)65.5935HybrIK
3D Human Pose EstimationEMDBAverage MVE (mm)122.193HybrIK
3D Human Pose EstimationEMDBAverage MVE-PA (mm)80.3678HybrIK
3D Human Pose EstimationEMDBJitter (10m/s^3)49.2068HybrIK
3D Human Pose EstimationMPI-INF-3DHPAUC46.9HybrIK
3D Human Pose EstimationMPI-INF-3DHPMPJPE91HybrIK
3D Human Pose EstimationMPI-INF-3DHPPCK87.5HybrIK
3D Human Pose Estimation3DPWMPJPE74.1HybrIK
3D Human Pose Estimation3DPWMPVPE86.5HybrIK
3D Human Pose Estimation3DPWPA-MPJPE45HybrIK
Pose EstimationEMDBAverage MPJAE (deg)24.5174HybrIK
Pose EstimationEMDBAverage MPJAE-PA (deg)23.0704HybrIK
Pose EstimationEMDBAverage MPJPE (mm)103.037HybrIK
Pose EstimationEMDBAverage MPJPE-PA (mm)65.5935HybrIK
Pose EstimationEMDBAverage MVE (mm)122.193HybrIK
Pose EstimationEMDBAverage MVE-PA (mm)80.3678HybrIK
Pose EstimationEMDBJitter (10m/s^3)49.2068HybrIK
Pose EstimationMPI-INF-3DHPAUC46.9HybrIK
Pose EstimationMPI-INF-3DHPMPJPE91HybrIK
Pose EstimationMPI-INF-3DHPPCK87.5HybrIK
Pose Estimation3DPWMPJPE74.1HybrIK
Pose Estimation3DPWMPVPE86.5HybrIK
Pose Estimation3DPWPA-MPJPE45HybrIK
3DEMDBAverage MPJAE (deg)24.5174HybrIK
3DEMDBAverage MPJAE-PA (deg)23.0704HybrIK
3DEMDBAverage MPJPE (mm)103.037HybrIK
3DEMDBAverage MPJPE-PA (mm)65.5935HybrIK
3DEMDBAverage MVE (mm)122.193HybrIK
3DEMDBAverage MVE-PA (mm)80.3678HybrIK
3DEMDBJitter (10m/s^3)49.2068HybrIK
3DMPI-INF-3DHPAUC46.9HybrIK
3DMPI-INF-3DHPMPJPE91HybrIK
3DMPI-INF-3DHPPCK87.5HybrIK
3D3DPWMPJPE74.1HybrIK
3D3DPWMPVPE86.5HybrIK
3D3DPWPA-MPJPE45HybrIK
1 Image, 2*2 StitchiEMDBAverage MPJAE (deg)24.5174HybrIK
1 Image, 2*2 StitchiEMDBAverage MPJAE-PA (deg)23.0704HybrIK
1 Image, 2*2 StitchiEMDBAverage MPJPE (mm)103.037HybrIK
1 Image, 2*2 StitchiEMDBAverage MPJPE-PA (mm)65.5935HybrIK
1 Image, 2*2 StitchiEMDBAverage MVE (mm)122.193HybrIK
1 Image, 2*2 StitchiEMDBAverage MVE-PA (mm)80.3678HybrIK
1 Image, 2*2 StitchiEMDBJitter (10m/s^3)49.2068HybrIK
1 Image, 2*2 StitchiMPI-INF-3DHPAUC46.9HybrIK
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE91HybrIK
1 Image, 2*2 StitchiMPI-INF-3DHPPCK87.5HybrIK
1 Image, 2*2 Stitchi3DPWMPJPE74.1HybrIK
1 Image, 2*2 Stitchi3DPWMPVPE86.5HybrIK
1 Image, 2*2 Stitchi3DPWPA-MPJPE45HybrIK

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16