TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Towards Viewpoint Invariant 3D Human Pose Estimation

Towards Viewpoint Invariant 3D Human Pose Estimation

Albert Haque, Boya Peng, Zelun Luo, Alexandre Alahi, Serena Yeung, Li Fei-Fei

2016-03-233D Human Pose EstimationPose EstimationMulti-Task Learning
PaperPDFCodeCode

Abstract

We propose a viewpoint invariant model for 3D human pose estimation from a single depth image. To achieve this, our discriminative model embeds local regions into a learned viewpoint invariant feature space. Formulated as a multi-task learning problem, our model is able to selectively predict partial poses in the presence of noise and occlusion. Our approach leverages a convolutional and recurrent network architecture with a top-down error feedback mechanism to self-correct previous pose estimates in an end-to-end manner. We evaluate our model on a previously published depth dataset and a newly collected human pose dataset containing 100K annotated depth images from extreme viewpoints. Experiments show that our model achieves competitive performance on frontal views while achieving state-of-the-art performance on alternate viewpoints.

Results

TaskDatasetMetricValueModel
Pose EstimationITOP top-viewMean mAP75.5Multi-task learning + viewpoint invariance
Pose Estimation ITOP front-viewMean mAP77.4Multi-task learning + viewpoint invariance
3DITOP top-viewMean mAP75.5Multi-task learning + viewpoint invariance
3D ITOP front-viewMean mAP77.4Multi-task learning + viewpoint invariance
1 Image, 2*2 StitchiITOP top-viewMean mAP75.5Multi-task learning + viewpoint invariance
1 Image, 2*2 Stitchi ITOP front-viewMean mAP77.4Multi-task learning + viewpoint invariance

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16