TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multi-hypothesis 3D human pose estimation metrics favor mi...

Multi-hypothesis 3D human pose estimation metrics favor miscalibrated distributions

Paweł A. Pierzchlewicz, R. James Cotton, Mohammad Bashiri, Fabian H. Sinz

2022-10-203D Human Pose EstimationDensity EstimationMulti-Hypotheses 3D Human Pose EstimationPose Estimation
PaperPDFCode(official)

Abstract

Due to depth ambiguities and occlusions, lifting 2D poses to 3D is a highly ill-posed problem. Well-calibrated distributions of possible poses can make these ambiguities explicit and preserve the resulting uncertainty for downstream tasks. This study shows that previous attempts, which account for these ambiguities via multiple hypotheses generation, produce miscalibrated distributions. We identify that miscalibration can be attributed to the use of sample-based metrics such as minMPJPE. In a series of simulations, we show that minimizing minMPJPE, as commonly done, should converge to the correct mean prediction. However, it fails to correctly capture the uncertainty, thus resulting in a miscalibrated distribution. To mitigate this problem, we propose an accurate and well-calibrated model called Conditional Graph Normalizing Flow (cGNFs). Our model is structured such that a single cGNF can estimate both conditional and marginal densities within the same model - effectively solving a zero-shot density estimation problem. We evaluate cGNF on the Human~3.6M dataset and show that cGNF provides a well-calibrated distribution estimate while being close to state-of-the-art in terms of overall minMPJPE. Furthermore, cGNF outperforms previous methods on occluded joints while it remains well-calibrated.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)48.5cGNF xlarge w Lsample
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)53cGNF w Lsample
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm) for occluded Joints41.8cGNF w Lsample
3D Human Pose EstimationHuman3.6MExpected Calibration Error0.08cGNF w Lsample
Pose EstimationHuman3.6MAverage MPJPE (mm)48.5cGNF xlarge w Lsample
Pose EstimationHuman3.6MAverage MPJPE (mm)53cGNF w Lsample
Pose EstimationHuman3.6MAverage MPJPE (mm) for occluded Joints41.8cGNF w Lsample
Pose EstimationHuman3.6MExpected Calibration Error0.08cGNF w Lsample
3DHuman3.6MAverage MPJPE (mm)48.5cGNF xlarge w Lsample
3DHuman3.6MAverage MPJPE (mm)53cGNF w Lsample
3DHuman3.6MAverage MPJPE (mm) for occluded Joints41.8cGNF w Lsample
3DHuman3.6MExpected Calibration Error0.08cGNF w Lsample
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)48.5cGNF xlarge w Lsample
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)53cGNF w Lsample
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm) for occluded Joints41.8cGNF w Lsample
1 Image, 2*2 StitchiHuman3.6MExpected Calibration Error0.08cGNF w Lsample

Related Papers

Missing value imputation with adversarial random forests -- MissARF2025-07-21$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16