DeepPrior++: Improving Fast and Accurate 3D Hand Pose Estimation

Markus Oberweger, Vincent Lepetit

2017-08-283D Hand Pose Estimation Data Augmentation Pose Estimation Hand Pose Estimation

Abstract

DeepPrior is a simple approach based on Deep Learning that predicts the joint 3D locations of a hand given a depth map. Since its publication early 2015, it has been outperformed by several impressive works. Here we show that with simple improvements: adding ResNet layers, data augmentation, and better initial hand localization, we achieve better or similar performance than more sophisticated recent methods on the three main benchmarks (NYU, ICVL, MSRA) while keeping the simplicity of the original method. Our new implementation is available at https://github.com/moberweger/deep-prior-pp .

Results

Task	Dataset	Metric	Value	Model
Hand	MSRA Hands	Average 3D Error	9.5	DeepPrior++
Hand	ICVL Hands	Average 3D Error	8.1	DeepPrior++
Hand	NYU Hands	Average 3D Error	12.3	DeepPrior++
Pose Estimation	MSRA Hands	Average 3D Error	9.5	DeepPrior++
Pose Estimation	ICVL Hands	Average 3D Error	8.1	DeepPrior++
Pose Estimation	NYU Hands	Average 3D Error	12.3	DeepPrior++
Hand Pose Estimation	MSRA Hands	Average 3D Error	9.5	DeepPrior++
Hand Pose Estimation	ICVL Hands	Average 3D Error	8.1	DeepPrior++
Hand Pose Estimation	NYU Hands	Average 3D Error	12.3	DeepPrior++
3D	MSRA Hands	Average 3D Error	9.5	DeepPrior++
3D	ICVL Hands	Average 3D Error	8.1	DeepPrior++
3D	NYU Hands	Average 3D Error	12.3	DeepPrior++
1 Image, 2*2 Stitchi	MSRA Hands	Average 3D Error	9.5	DeepPrior++
1 Image, 2*2 Stitchi	ICVL Hands	Average 3D Error	8.1	DeepPrior++
1 Image, 2*2 Stitchi	NYU Hands	Average 3D Error	12.3	DeepPrior++

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17 Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17 $π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17 Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17 DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17 From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17 AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17 Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16