TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Ordinal Depth Supervision for 3D Human Pose Estimation

Ordinal Depth Supervision for 3D Human Pose Estimation

Georgios Pavlakos, Xiaowei Zhou, Kostas Daniilidis

2018-05-10CVPR 2018 63D Human Pose EstimationMonocular 3D Human Pose EstimationPose Estimation
PaperPDFCode

Abstract

Our ability to train end-to-end systems for 3D human pose estimation from single images is currently constrained by the limited availability of 3D annotations for natural images. Most datasets are captured using Motion Capture (MoCap) systems in a studio setting and it is difficult to reach the variability of 2D human pose datasets, like MPII or LSP. To alleviate the need for accurate 3D ground truth, we propose to use a weaker supervision signal provided by the ordinal depths of human joints. This information can be acquired by human annotators for a wide range of images and poses. We showcase the effectiveness and flexibility of training Convolutional Networks (ConvNets) with these ordinal relations in different settings, always achieving competitive performance with ConvNets trained with accurate 3D joint coordinates. Additionally, to demonstrate the potential of the approach, we augment the popular LSP and MPII datasets with ordinal depth annotations. This extension allows us to present quantitative and qualitative evaluation in non-studio conditions. Simultaneously, these ordinal annotations can be easily incorporated in the training procedure of typical ConvNets for 3D human pose. Through this inclusion we achieve new state-of-the-art performance for the relevant benchmarks and validate the effectiveness of ordinal depth supervision for 3D human pose.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationHumanEva-IMean Reconstruction Error (mm)18.3Ordinal Depth Supervision
3D Human Pose EstimationMPI-INF-3DHPAUC35.3Ordinal Depth Supervision
3D Human Pose EstimationMPI-INF-3DHPPCK71.9Ordinal Depth Supervision
3D Human Pose EstimationHuman3.6MFrames Needed1Ordinal Depth Supervision
Pose EstimationHumanEva-IMean Reconstruction Error (mm)18.3Ordinal Depth Supervision
Pose EstimationMPI-INF-3DHPAUC35.3Ordinal Depth Supervision
Pose EstimationMPI-INF-3DHPPCK71.9Ordinal Depth Supervision
Pose EstimationHuman3.6MFrames Needed1Ordinal Depth Supervision
3DHumanEva-IMean Reconstruction Error (mm)18.3Ordinal Depth Supervision
3DMPI-INF-3DHPAUC35.3Ordinal Depth Supervision
3DMPI-INF-3DHPPCK71.9Ordinal Depth Supervision
3DHuman3.6MFrames Needed1Ordinal Depth Supervision
1 Image, 2*2 StitchiHumanEva-IMean Reconstruction Error (mm)18.3Ordinal Depth Supervision
1 Image, 2*2 StitchiMPI-INF-3DHPAUC35.3Ordinal Depth Supervision
1 Image, 2*2 StitchiMPI-INF-3DHPPCK71.9Ordinal Depth Supervision
1 Image, 2*2 StitchiHuman3.6MFrames Needed1Ordinal Depth Supervision

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16