Jiefeng Li, Siyuan Bian, Chao Xu, Zhicun Chen, Lixin Yang, Cewu Lu
Recovering whole-body mesh by inferring the abstract pose and shape parameters from visual content can obtain 3D bodies with realistic structures. However, the inferring process is highly non-linear and suffers from image-mesh misalignment, resulting in inaccurate reconstruction. In contrast, 3D keypoint estimation methods utilize the volumetric representation to achieve pixel-level accuracy but may predict unrealistic body structures. To address these issues, this paper presents a novel hybrid inverse kinematics solution, HybrIK, that integrates the merits of 3D keypoint estimation and body mesh recovery in a unified framework. HybrIK directly transforms accurate 3D joints to body-part rotations via twist-and-swing decomposition. The swing rotations are analytically solved with 3D joints, while the twist rotations are derived from visual cues through neural networks. To capture comprehensive whole-body details, we further develop a holistic framework, HybrIK-X, which enhances HybrIK with articulated hands and an expressive face. HybrIK-X is fast and accurate by solving the whole-body pose with a one-stage model. Experiments demonstrate that HybrIK and HybrIK-X preserve both the accuracy of 3D joints and the realistic structure of the parametric human model, leading to pixel-aligned whole-body mesh recovery. The proposed method significantly surpasses the state-of-the-art methods on various benchmarks for body-only, hand-only, and whole-body scenarios. Code and results can be found at https://jeffli.site/HybrIK-X/
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Reconstruction | AGORA | FB-MPJPE | 107.6 | HybrIK-X |
| Reconstruction | AGORA | FB-MVE | 112.1 | HybrIK-X |
| Reconstruction | AGORA | FB-NMJE | 115.7 | HybrIK-X |
| Reconstruction | AGORA | FB-NMVE | 120.5 | HybrIK-X |
| 3D Human Pose Estimation | AGORA | B-MPJPE | 67.2 | HybrIK-X |
| 3D Human Pose Estimation | AGORA | B-MVE | 68.5 | HybrIK-X |
| 3D Human Pose Estimation | AGORA | B-NMJE | 72.3 | HybrIK-X |
| 3D Human Pose Estimation | AGORA | B-NMVE | 73.7 | HybrIK-X |
| 3D Human Pose Estimation | AGORA | B-MPJPE | 77 | HybrIK |
| 3D Human Pose Estimation | AGORA | B-MVE | 73.9 | HybrIK |
| 3D Human Pose Estimation | AGORA | B-NMJE | 84.6 | HybrIK |
| 3D Human Pose Estimation | AGORA | B-NMVE | 81.2 | HybrIK |
| 3D Human Pose Estimation | MPI-INF-3DHP | AUC | 47.3 | HybrIK (HRNet-W48) |
| 3D Human Pose Estimation | MPI-INF-3DHP | MPJPE | 91 | HybrIK (HRNet-W48) |
| 3D Human Pose Estimation | MPI-INF-3DHP | PCK | 87.1 | HybrIK (HRNet-W48) |
| 3D Human Pose Estimation | 3DPW | MPJPE | 71.6 | HybrIK (HRNet-W48) |
| 3D Human Pose Estimation | 3DPW | MPVPE | 82.3 | HybrIK (HRNet-W48) |
| 3D Human Pose Estimation | 3DPW | PA-MPJPE | 41.8 | HybrIK (HRNet-W48) |
| Pose Estimation | AGORA | B-MPJPE | 67.2 | HybrIK-X |
| Pose Estimation | AGORA | B-MVE | 68.5 | HybrIK-X |
| Pose Estimation | AGORA | B-NMJE | 72.3 | HybrIK-X |
| Pose Estimation | AGORA | B-NMVE | 73.7 | HybrIK-X |
| Pose Estimation | AGORA | B-MPJPE | 77 | HybrIK |
| Pose Estimation | AGORA | B-MVE | 73.9 | HybrIK |
| Pose Estimation | AGORA | B-NMJE | 84.6 | HybrIK |
| Pose Estimation | AGORA | B-NMVE | 81.2 | HybrIK |
| Pose Estimation | MPI-INF-3DHP | AUC | 47.3 | HybrIK (HRNet-W48) |
| Pose Estimation | MPI-INF-3DHP | MPJPE | 91 | HybrIK (HRNet-W48) |
| Pose Estimation | MPI-INF-3DHP | PCK | 87.1 | HybrIK (HRNet-W48) |
| Pose Estimation | 3DPW | MPJPE | 71.6 | HybrIK (HRNet-W48) |
| Pose Estimation | 3DPW | MPVPE | 82.3 | HybrIK (HRNet-W48) |
| Pose Estimation | 3DPW | PA-MPJPE | 41.8 | HybrIK (HRNet-W48) |
| 3D | AGORA | B-MPJPE | 67.2 | HybrIK-X |
| 3D | AGORA | B-MVE | 68.5 | HybrIK-X |
| 3D | AGORA | B-NMJE | 72.3 | HybrIK-X |
| 3D | AGORA | B-NMVE | 73.7 | HybrIK-X |
| 3D | AGORA | B-MPJPE | 77 | HybrIK |
| 3D | AGORA | B-MVE | 73.9 | HybrIK |
| 3D | AGORA | B-NMJE | 84.6 | HybrIK |
| 3D | AGORA | B-NMVE | 81.2 | HybrIK |
| 3D | MPI-INF-3DHP | AUC | 47.3 | HybrIK (HRNet-W48) |
| 3D | MPI-INF-3DHP | MPJPE | 91 | HybrIK (HRNet-W48) |
| 3D | MPI-INF-3DHP | PCK | 87.1 | HybrIK (HRNet-W48) |
| 3D | 3DPW | MPJPE | 71.6 | HybrIK (HRNet-W48) |
| 3D | 3DPW | MPVPE | 82.3 | HybrIK (HRNet-W48) |
| 3D | 3DPW | PA-MPJPE | 41.8 | HybrIK (HRNet-W48) |
| 1 Image, 2*2 Stitchi | AGORA | B-MPJPE | 67.2 | HybrIK-X |
| 1 Image, 2*2 Stitchi | AGORA | B-MVE | 68.5 | HybrIK-X |
| 1 Image, 2*2 Stitchi | AGORA | B-NMJE | 72.3 | HybrIK-X |
| 1 Image, 2*2 Stitchi | AGORA | B-NMVE | 73.7 | HybrIK-X |
| 1 Image, 2*2 Stitchi | AGORA | B-MPJPE | 77 | HybrIK |
| 1 Image, 2*2 Stitchi | AGORA | B-MVE | 73.9 | HybrIK |
| 1 Image, 2*2 Stitchi | AGORA | B-NMJE | 84.6 | HybrIK |
| 1 Image, 2*2 Stitchi | AGORA | B-NMVE | 81.2 | HybrIK |
| 1 Image, 2*2 Stitchi | MPI-INF-3DHP | AUC | 47.3 | HybrIK (HRNet-W48) |
| 1 Image, 2*2 Stitchi | MPI-INF-3DHP | MPJPE | 91 | HybrIK (HRNet-W48) |
| 1 Image, 2*2 Stitchi | MPI-INF-3DHP | PCK | 87.1 | HybrIK (HRNet-W48) |
| 1 Image, 2*2 Stitchi | 3DPW | MPJPE | 71.6 | HybrIK (HRNet-W48) |
| 1 Image, 2*2 Stitchi | 3DPW | MPVPE | 82.3 | HybrIK (HRNet-W48) |
| 1 Image, 2*2 Stitchi | 3DPW | PA-MPJPE | 41.8 | HybrIK (HRNet-W48) |