TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Consensus-based Optimization for 3D Human Pose Estimation ...

Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates

Diogo C. Luvizon, Hedi Tabia, David Picard

2019-11-213D Human Pose EstimationDepth PredictionPose EstimationDepth Estimation
PaperPDFCode(official)

Abstract

3D human pose estimation is frequently seen as the task of estimating 3D poses relative to the root body joint. Alternatively, we propose a 3D human pose estimation method in camera coordinates, which allows effective combination of 2D annotated data and 3D poses and a straightforward multi-view generalization. To that end, we cast the problem as a view frustum space pose estimation, where absolute depth prediction and joint relative depth estimations are disentangled. Final 3D predictions are obtained in camera coordinates by the inverse camera projection. Based on this, we also present a consensus-based optimization algorithm for multi-view predictions from uncalibrated images, which requires a single monocular training procedure. Although our method is indirectly tied to the training camera intrinsics, it still converges for cameras with different intrinsic parameters, resulting in coherent estimations up to a scale factor. Our method improves the state of the art on well known 3D human pose datasets, reducing the prediction error by 32% in the most common benchmark. We also reported our results in absolute pose position error, achieving 80~mm for monocular estimations and 51~mm for multi-view, on average.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPAUC42.1Pose Consensus (monocular)
3D Human Pose EstimationMPI-INF-3DHPMPJPE112.1Pose Consensus (monocular)
3D Human Pose EstimationMPI-INF-3DHPPCK80.6Pose Consensus (monocular)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)39Pose Consensus (multi-view, GT calib.)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)45Pose Consensus (multi-view, est. calib.)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)52Pose Consensus (monocular)
Pose EstimationMPI-INF-3DHPAUC42.1Pose Consensus (monocular)
Pose EstimationMPI-INF-3DHPMPJPE112.1Pose Consensus (monocular)
Pose EstimationMPI-INF-3DHPPCK80.6Pose Consensus (monocular)
Pose EstimationHuman3.6MAverage MPJPE (mm)39Pose Consensus (multi-view, GT calib.)
Pose EstimationHuman3.6MAverage MPJPE (mm)45Pose Consensus (multi-view, est. calib.)
Pose EstimationHuman3.6MAverage MPJPE (mm)52Pose Consensus (monocular)
3DMPI-INF-3DHPAUC42.1Pose Consensus (monocular)
3DMPI-INF-3DHPMPJPE112.1Pose Consensus (monocular)
3DMPI-INF-3DHPPCK80.6Pose Consensus (monocular)
3DHuman3.6MAverage MPJPE (mm)39Pose Consensus (multi-view, GT calib.)
3DHuman3.6MAverage MPJPE (mm)45Pose Consensus (multi-view, est. calib.)
3DHuman3.6MAverage MPJPE (mm)52Pose Consensus (monocular)
1 Image, 2*2 StitchiMPI-INF-3DHPAUC42.1Pose Consensus (monocular)
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE112.1Pose Consensus (monocular)
1 Image, 2*2 StitchiMPI-INF-3DHPPCK80.6Pose Consensus (monocular)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)39Pose Consensus (multi-view, GT calib.)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)45Pose Consensus (multi-view, est. calib.)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)52Pose Consensus (monocular)

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16