TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Towards 3D Human Pose Estimation in the Wild: a Weakly-sup...

Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach

Xingyi Zhou, Qi-Xing Huang, Xiao Sun, xiangyang xue, Yichen Wei

2017-04-08ICCV 2017 103D Human Pose EstimationMonocular 3D Human Pose EstimationTransfer LearningPose EstimationPose Prediction3D Multi-Person Pose Estimation (root-relative)3D Multi-Person Pose Estimation (absolute)2D Pose Estimation
PaperPDFCodeCodeCodeCodeCodeCode(official)

Abstract

In this paper, we study the task of 3D human pose estimation in the wild. This task is challenging due to lack of training data, as existing datasets are either in the wild images with 2D pose or in the lab images with 3D pose. We propose a weakly-supervised transfer learning method that uses mixed 2D and 3D labels in a unified deep neutral network that presents two-stage cascaded structure. Our network augments a state-of-the-art 2D pose estimation sub-network with a 3D depth regression sub-network. Unlike previous two stage approaches that train the two sub-networks sequentially and separately, our training is end-to-end and fully exploits the correlation between the 2D pose and depth estimation sub-tasks. The deep features are better learnt through shared representations. In doing so, the 3D pose labels in controlled lab environments are transferred to in the wild images. In addition, we introduce a 3D geometric constraint to regularize the 3D pose prediction, which is effective in the absence of ground truth depth labels. Our method achieves competitive results on both 2D and 3D benchmarks.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationGeometric Pose Affordance MPJPE (CA)89.2Baseline model
3D Human Pose EstimationGeometric Pose Affordance MPJPE (CS)99.4Baseline model
3D Human Pose EstimationGeometric Pose Affordance PCK3D (CA)83.6Baseline model
3D Human Pose EstimationGeometric Pose Affordance PCK3D (CS)81.3Baseline model
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)64.9Weakly Supervised Transfer Learning
3D Human Pose EstimationHuman3.6MFrames Needed1Weakly Supervised Transfer Learning
Pose EstimationGeometric Pose Affordance MPJPE (CA)89.2Baseline model
Pose EstimationGeometric Pose Affordance MPJPE (CS)99.4Baseline model
Pose EstimationGeometric Pose Affordance PCK3D (CA)83.6Baseline model
Pose EstimationGeometric Pose Affordance PCK3D (CS)81.3Baseline model
Pose EstimationHuman3.6MAverage MPJPE (mm)64.9Weakly Supervised Transfer Learning
Pose EstimationHuman3.6MFrames Needed1Weakly Supervised Transfer Learning
3DGeometric Pose Affordance MPJPE (CA)89.2Baseline model
3DGeometric Pose Affordance MPJPE (CS)99.4Baseline model
3DGeometric Pose Affordance PCK3D (CA)83.6Baseline model
3DGeometric Pose Affordance PCK3D (CS)81.3Baseline model
3DHuman3.6MAverage MPJPE (mm)64.9Weakly Supervised Transfer Learning
3DHuman3.6MFrames Needed1Weakly Supervised Transfer Learning
1 Image, 2*2 StitchiGeometric Pose Affordance MPJPE (CA)89.2Baseline model
1 Image, 2*2 StitchiGeometric Pose Affordance MPJPE (CS)99.4Baseline model
1 Image, 2*2 StitchiGeometric Pose Affordance PCK3D (CA)83.6Baseline model
1 Image, 2*2 StitchiGeometric Pose Affordance PCK3D (CS)81.3Baseline model
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)64.9Weakly Supervised Transfer Learning
1 Image, 2*2 StitchiHuman3.6MFrames Needed1Weakly Supervised Transfer Learning

Related Papers

RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows2025-07-16