TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Geometric Pose Affordance: 3D Human Pose with Scene Constr...

Geometric Pose Affordance: 3D Human Pose with Scene Constraints

Zhe Wang, Liyan Chen, Shaurya Rathore, Daeyun Shin, Charless Fowlkes

2019-05-193D Human Pose EstimationPose Estimation
PaperPDF

Abstract

Full 3D estimation of human pose from a single image remains a challenging task despite many recent advances. In this paper, we explore the hypothesis that strong prior information about scene geometry can be used to improve pose estimation accuracy. To tackle this question empirically, we have assembled a novel $\textbf{Geometric Pose Affordance}$ dataset, consisting of multi-view imagery of people interacting with a variety of rich 3D environments. We utilized a commercial motion capture system to collect gold-standard estimates of pose and construct accurate geometric 3D CAD models of the scene itself. To inject prior knowledge of scene constraints into existing frameworks for pose estimation from images, we introduce a novel, view-based representation of scene geometry, a $\textbf{multi-layer depth map}$, which employs multi-hit ray tracing to concisely encode multiple surface entry and exit points along each camera view ray direction. We propose two different mechanisms for integrating multi-layer depth information pose estimation: input as encoded ray features used in lifting 2D pose to full 3D, and secondly as a differentiable loss that encourages learned models to favor geometrically consistent pose estimates. We show experimentally that these techniques can improve the accuracy of 3D pose estimates, particularly in the presence of occlusion and complex scene geometry.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationGeometric Pose AffordanceMPJPE94.1ResNet-F
3D Human Pose EstimationGeometric Pose AffordanceMPJPE (CA)85.6ResNet-F
3D Human Pose EstimationGeometric Pose AffordanceMPJPE (CS)97.8ResNet-F
3D Human Pose EstimationGeometric Pose AffordancePCK82.9ResNet-F
3D Human Pose EstimationGeometric Pose AffordancePCK3D (CA)84.8ResNet-F
3D Human Pose EstimationGeometric Pose AffordancePCK3D (CS)82ResNet-F
Pose EstimationGeometric Pose AffordanceMPJPE94.1ResNet-F
Pose EstimationGeometric Pose AffordanceMPJPE (CA)85.6ResNet-F
Pose EstimationGeometric Pose AffordanceMPJPE (CS)97.8ResNet-F
Pose EstimationGeometric Pose AffordancePCK82.9ResNet-F
Pose EstimationGeometric Pose AffordancePCK3D (CA)84.8ResNet-F
Pose EstimationGeometric Pose AffordancePCK3D (CS)82ResNet-F
3DGeometric Pose AffordanceMPJPE94.1ResNet-F
3DGeometric Pose AffordanceMPJPE (CA)85.6ResNet-F
3DGeometric Pose AffordanceMPJPE (CS)97.8ResNet-F
3DGeometric Pose AffordancePCK82.9ResNet-F
3DGeometric Pose AffordancePCK3D (CA)84.8ResNet-F
3DGeometric Pose AffordancePCK3D (CS)82ResNet-F
1 Image, 2*2 StitchiGeometric Pose AffordanceMPJPE94.1ResNet-F
1 Image, 2*2 StitchiGeometric Pose AffordanceMPJPE (CA)85.6ResNet-F
1 Image, 2*2 StitchiGeometric Pose AffordanceMPJPE (CS)97.8ResNet-F
1 Image, 2*2 StitchiGeometric Pose AffordancePCK82.9ResNet-F
1 Image, 2*2 StitchiGeometric Pose AffordancePCK3D (CA)84.8ResNet-F
1 Image, 2*2 StitchiGeometric Pose AffordancePCK3D (CS)82ResNet-F

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16