Towards Viewpoint Invariant 3D Human Pose Estimation

Albert Haque, Boya Peng, Zelun Luo, Alexandre Alahi, Serena Yeung, Li Fei-Fei

2016-03-233D Human Pose Estimation Pose Estimation Multi-Task Learning

Abstract

We propose a viewpoint invariant model for 3D human pose estimation from a single depth image. To achieve this, our discriminative model embeds local regions into a learned viewpoint invariant feature space. Formulated as a multi-task learning problem, our model is able to selectively predict partial poses in the presence of noise and occlusion. Our approach leverages a convolutional and recurrent network architecture with a top-down error feedback mechanism to self-correct previous pose estimates in an end-to-end manner. We evaluate our model on a previously published depth dataset and a newly collected human pose dataset containing 100K annotated depth images from extreme viewpoints. Experiments show that our model achieves competitive performance on frontal views while achieving state-of-the-art performance on alternate viewpoints.

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	ITOP top-view	Mean mAP	75.5	Multi-task learning + viewpoint invariance
Pose Estimation	ITOP front-view	Mean mAP	77.4	Multi-task learning + viewpoint invariance
3D	ITOP top-view	Mean mAP	75.5	Multi-task learning + viewpoint invariance
3D	ITOP front-view	Mean mAP	77.4	Multi-task learning + viewpoint invariance
1 Image, 2*2 Stitchi	ITOP top-view	Mean mAP	75.5	Multi-task learning + viewpoint invariance
1 Image, 2*2 Stitchi	ITOP front-view	Mean mAP	77.4	Multi-task learning + viewpoint invariance

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17 Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17 DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17 From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17 AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17 SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17 SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16 SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16