TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/3D Human Pose and Shape Estimation via HybrIK-Transformer

3D Human Pose and Shape Estimation via HybrIK-Transformer

Boris N. Oreshkin

2023-02-093D Human Pose EstimationMonocular 3D Human Pose EstimationPose Estimation3D human pose and shape estimationDeep Learning3D Pose Estimation
PaperPDFCode(official)

Abstract

HybrIK relies on a combination of analytical inverse kinematics and deep learning to produce more accurate 3D pose estimation from 2D monocular images. HybrIK has three major components: (1) pretrained convolution backbone, (2) deconvolution to lift 3D pose from 2D convolution features, (3) analytical inverse kinematics pass correcting deep learning prediction using learned distribution of plausible twist and swing angles. In this paper we propose an enhancement of the 2D to 3D lifting module, replacing deconvolution with Transformer, resulting in accuracy and computational efficiency improvement relative to the original HybrIK method. We demonstrate our results on commonly used H36M, PW3D, COCO and HP3D datasets. Our code is publicly available https://github.com/boreshkinai/hybrik-transformer.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPAUC48.9HybrIK-Transformer (HrNet-48)
3D Human Pose EstimationMPI-INF-3DHPMPJPE86.2HybrIK-Transformer (HrNet-48)
3D Human Pose EstimationMPI-INF-3DHPPCK88.6HybrIK-Transformer (HrNet-48)
3D Human Pose Estimation3DPWMPJPE71.6HybrIK-Transformer (HrNet-48)
3D Human Pose Estimation3DPWMPVPE83.6HybrIK-Transformer (HrNet-48)
3D Human Pose Estimation3DPWPA-MPJPE42.3HybrIK-Transformer (HrNet-48)
Pose EstimationMPI-INF-3DHPAUC48.9HybrIK-Transformer (HrNet-48)
Pose EstimationMPI-INF-3DHPMPJPE86.2HybrIK-Transformer (HrNet-48)
Pose EstimationMPI-INF-3DHPPCK88.6HybrIK-Transformer (HrNet-48)
Pose Estimation3DPWMPJPE71.6HybrIK-Transformer (HrNet-48)
Pose Estimation3DPWMPVPE83.6HybrIK-Transformer (HrNet-48)
Pose Estimation3DPWPA-MPJPE42.3HybrIK-Transformer (HrNet-48)
3DMPI-INF-3DHPAUC48.9HybrIK-Transformer (HrNet-48)
3DMPI-INF-3DHPMPJPE86.2HybrIK-Transformer (HrNet-48)
3DMPI-INF-3DHPPCK88.6HybrIK-Transformer (HrNet-48)
3D3DPWMPJPE71.6HybrIK-Transformer (HrNet-48)
3D3DPWMPVPE83.6HybrIK-Transformer (HrNet-48)
3D3DPWPA-MPJPE42.3HybrIK-Transformer (HrNet-48)
1 Image, 2*2 StitchiMPI-INF-3DHPAUC48.9HybrIK-Transformer (HrNet-48)
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE86.2HybrIK-Transformer (HrNet-48)
1 Image, 2*2 StitchiMPI-INF-3DHPPCK88.6HybrIK-Transformer (HrNet-48)
1 Image, 2*2 Stitchi3DPWMPJPE71.6HybrIK-Transformer (HrNet-48)
1 Image, 2*2 Stitchi3DPWMPVPE83.6HybrIK-Transformer (HrNet-48)
1 Image, 2*2 Stitchi3DPWPA-MPJPE42.3HybrIK-Transformer (HrNet-48)

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16