TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/IKOL: Inverse kinematics optimization layer for 3D human p...

IKOL: Inverse kinematics optimization layer for 3D human pose and shape estimation via Gauss-Newton differentiation

Juze Zhang, Ye Shi, Yuexin Ma, Lan Xu, Jingyi Yu, Jingya Wang

2023-02-023D Human Pose Estimationregression3D human pose and shape estimation
PaperPDFCode(official)

Abstract

This paper presents an inverse kinematic optimization layer (IKOL) for 3D human pose and shape estimation that leverages the strength of both optimization- and regression-based methods within an end-to-end framework. IKOL involves a nonconvex optimization that establishes an implicit mapping from an image's 3D keypoints and body shapes to the relative body-part rotations. The 3D keypoints and the body shapes are the inputs and the relative body-part rotations are the solutions. However, this procedure is implicit and hard to make differentiable. So, to overcome this issue, we designed a Gauss-Newton differentiation (GN-Diff) procedure to differentiate IKOL. GN-Diff iteratively linearizes the nonconvex objective function to obtain Gauss-Newton directions with closed form solutions. Then, an automatic differentiation procedure is directly applied to generate a Jacobian matrix for end-to-end training. Notably, the GN-Diff procedure works fast because it does not rely on a time-consuming implicit differentiation procedure. The twist rotation and shape parameters are learned from the neural networks and, as a result, IKOL has a much lower computational overhead than most existing optimization-based methods. Additionally, compared to existing regression-based methods, IKOL provides a more accurate mesh-image correspondence. This is because it iteratively reduces the distance between the keypoints and also enhances the reliability of the pose structures. Extensive experiments demonstrate the superiority of our proposed framework over a wide range of 3D human pose and shape estimation methods.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPAUC48.1IKOL
3D Human Pose EstimationMPI-INF-3DHPMPJPE88.8IKOL
3D Human Pose EstimationMPI-INF-3DHPPCK87.9IKOL
3D Human Pose Estimation3DPWMPJPE71.1IKOL
3D Human Pose Estimation3DPWMPVPE84.1IKOL
3D Human Pose Estimation3DPWPA-MPJPE44.5IKOL
Pose EstimationMPI-INF-3DHPAUC48.1IKOL
Pose EstimationMPI-INF-3DHPMPJPE88.8IKOL
Pose EstimationMPI-INF-3DHPPCK87.9IKOL
Pose Estimation3DPWMPJPE71.1IKOL
Pose Estimation3DPWMPVPE84.1IKOL
Pose Estimation3DPWPA-MPJPE44.5IKOL
3DMPI-INF-3DHPAUC48.1IKOL
3DMPI-INF-3DHPMPJPE88.8IKOL
3DMPI-INF-3DHPPCK87.9IKOL
3D3DPWMPJPE71.1IKOL
3D3DPWMPVPE84.1IKOL
3D3DPWPA-MPJPE44.5IKOL
1 Image, 2*2 StitchiMPI-INF-3DHPAUC48.1IKOL
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE88.8IKOL
1 Image, 2*2 StitchiMPI-INF-3DHPPCK87.9IKOL
1 Image, 2*2 Stitchi3DPWMPJPE71.1IKOL
1 Image, 2*2 Stitchi3DPWMPVPE84.1IKOL
1 Image, 2*2 Stitchi3DPWPA-MPJPE44.5IKOL

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16Sparse Regression Codes exploit Multi-User Diversity without CSI2025-07-15Bradley-Terry and Multi-Objective Reward Modeling Are Complementary2025-07-10Active Learning for Manifold Gaussian Process Regression2025-06-26A Survey of Predictive Maintenance Methods: An Analysis of Prognostics via Classification and Regression2025-06-25