TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Generating Multiple Hypotheses for 3D Human Pose Estimatio...

Generating Multiple Hypotheses for 3D Human Pose Estimation with Mixture Density Network

Chen Li, Gim Hee Lee

2019-04-11CVPR 2019 63D Human Pose EstimationMonocular 3D Human Pose EstimationMulti-Hypotheses 3D Human Pose EstimationPose Estimation
PaperPDFCode(official)Code

Abstract

3D human pose estimation from a monocular image or 2D joints is an ill-posed problem because of depth ambiguity and occluded joints. We argue that 3D human pose estimation from a monocular input is an inverse problem where multiple feasible solutions can exist. In this paper, we propose a novel approach to generate multiple feasible hypotheses of the 3D pose from 2D joints.In contrast to existing deep learning approaches which minimize a mean square error based on an unimodal Gaussian distribution, our method is able to generate multiple feasible hypotheses of 3D pose based on a multimodal mixture density networks. Our experiments show that the 3D poses estimated by our approach from an input of 2D joints are consistent in 2D reprojections, which supports our argument that multiple solutions exist for the 2D-to-3D inverse problem. Furthermore, we show state-of-the-art performance on the Human3.6M dataset in both best hypothesis and multi-view settings, and we demonstrate the generalization capacity of our model by testing on the MPII and MPI-INF-3DHP datasets. Our code is available at the project website.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPPCK67.9MDM
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)49.6MDN (Multi-View)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)52.7MDN
3D Human Pose EstimationHuman3.6MPA-MPJPE42.6MDN
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)52.7Multimodal Mixture Density Networks
3D Human Pose EstimationHuman3.6MFrames Needed1Multimodal Mixture Density Networks
3D Human Pose EstimationAH36MBest-Hypothesis MPJPE (n = 25)91.5SMPL-MDN (by 3D Multi-bodies)
3D Human Pose EstimationAH36MBest-Hypothesis PMPJPE (n = 25)69.5SMPL-MDN (by 3D Multi-bodies)
3D Human Pose EstimationAH36MH36M PMPJPE (n = 1)44.8SMPL-MDN (by 3D Multi-bodies)
3D Human Pose EstimationAH36MH36M PMPJPE (n = 25)42.7SMPL-MDN (by 3D Multi-bodies)
3D Human Pose EstimationAH36MMost-Likely Hypothesis PMPJPE (n = 1)74.7SMPL-MDN (by 3D Multi-bodies)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)52.7MDN
3D Human Pose EstimationHuman3.6MAverage PMPJPE (mm)42.6MDN
Pose EstimationMPI-INF-3DHPPCK67.9MDM
Pose EstimationHuman3.6MAverage MPJPE (mm)49.6MDN (Multi-View)
Pose EstimationHuman3.6MAverage MPJPE (mm)52.7MDN
Pose EstimationHuman3.6MPA-MPJPE42.6MDN
Pose EstimationHuman3.6MAverage MPJPE (mm)52.7Multimodal Mixture Density Networks
Pose EstimationHuman3.6MFrames Needed1Multimodal Mixture Density Networks
Pose EstimationAH36MBest-Hypothesis MPJPE (n = 25)91.5SMPL-MDN (by 3D Multi-bodies)
Pose EstimationAH36MBest-Hypothesis PMPJPE (n = 25)69.5SMPL-MDN (by 3D Multi-bodies)
Pose EstimationAH36MH36M PMPJPE (n = 1)44.8SMPL-MDN (by 3D Multi-bodies)
Pose EstimationAH36MH36M PMPJPE (n = 25)42.7SMPL-MDN (by 3D Multi-bodies)
Pose EstimationAH36MMost-Likely Hypothesis PMPJPE (n = 1)74.7SMPL-MDN (by 3D Multi-bodies)
Pose EstimationHuman3.6MAverage MPJPE (mm)52.7MDN
Pose EstimationHuman3.6MAverage PMPJPE (mm)42.6MDN
3DMPI-INF-3DHPPCK67.9MDM
3DHuman3.6MAverage MPJPE (mm)49.6MDN (Multi-View)
3DHuman3.6MAverage MPJPE (mm)52.7MDN
3DHuman3.6MPA-MPJPE42.6MDN
3DHuman3.6MAverage MPJPE (mm)52.7Multimodal Mixture Density Networks
3DHuman3.6MFrames Needed1Multimodal Mixture Density Networks
3DAH36MBest-Hypothesis MPJPE (n = 25)91.5SMPL-MDN (by 3D Multi-bodies)
3DAH36MBest-Hypothesis PMPJPE (n = 25)69.5SMPL-MDN (by 3D Multi-bodies)
3DAH36MH36M PMPJPE (n = 1)44.8SMPL-MDN (by 3D Multi-bodies)
3DAH36MH36M PMPJPE (n = 25)42.7SMPL-MDN (by 3D Multi-bodies)
3DAH36MMost-Likely Hypothesis PMPJPE (n = 1)74.7SMPL-MDN (by 3D Multi-bodies)
3DHuman3.6MAverage MPJPE (mm)52.7MDN
3DHuman3.6MAverage PMPJPE (mm)42.6MDN
1 Image, 2*2 StitchiMPI-INF-3DHPPCK67.9MDM
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)49.6MDN (Multi-View)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)52.7MDN
1 Image, 2*2 StitchiHuman3.6MPA-MPJPE42.6MDN
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)52.7Multimodal Mixture Density Networks
1 Image, 2*2 StitchiHuman3.6MFrames Needed1Multimodal Mixture Density Networks
1 Image, 2*2 StitchiAH36MBest-Hypothesis MPJPE (n = 25)91.5SMPL-MDN (by 3D Multi-bodies)
1 Image, 2*2 StitchiAH36MBest-Hypothesis PMPJPE (n = 25)69.5SMPL-MDN (by 3D Multi-bodies)
1 Image, 2*2 StitchiAH36MH36M PMPJPE (n = 1)44.8SMPL-MDN (by 3D Multi-bodies)
1 Image, 2*2 StitchiAH36MH36M PMPJPE (n = 25)42.7SMPL-MDN (by 3D Multi-bodies)
1 Image, 2*2 StitchiAH36MMost-Likely Hypothesis PMPJPE (n = 1)74.7SMPL-MDN (by 3D Multi-bodies)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)52.7MDN
1 Image, 2*2 StitchiHuman3.6MAverage PMPJPE (mm)42.6MDN

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16