TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Dense 3D Regression for Hand Pose Estimation

Dense 3D Regression for Hand Pose Estimation

Chengde Wan, Thomas Probst, Luc van Gool, Angela Yao

2017-11-24CVPR 2018 63D Hand Pose EstimationregressionPose EstimationHand Pose Estimation
PaperPDFCode(official)

Abstract

We present a simple and effective method for 3D hand pose estimation from a single depth frame. As opposed to previous state-of-the-art methods based on holistic 3D regression, our method works on dense pixel-wise estimation. This is achieved by careful design choices in pose parameterization, which leverages both 2D and 3D properties of depth map. Specifically, we decompose the pose parameters into a set of per-pixel estimations, i.e., 2D heat maps, 3D heat maps and unit 3D directional vector fields. The 2D/3D joint heat maps and 3D joint offsets are estimated via multi-task network cascades, which is trained end-to-end. The pixel-wise estimations can be directly translated into a vote casting scheme. A variant of mean shift is then used to aggregate local votes while enforcing consensus between the the estimated 3D pose and the pixel-wise 2D and 3D estimations by design. Our method is efficient and highly accurate. On MSRA and NYU hand dataset, our method outperforms all previous state-of-the-art approaches by a large margin. On the ICVL hand dataset, our method achieves similar accuracy compared to the currently proposed nearly saturated result and outperforms various other proposed methods. Code is available $\href{"https://github.com/melonwan/denseReg"}{\text{online}}$.

Results

TaskDatasetMetricValueModel
HandMSRA HandsAverage 3D Error7.2Dense Pixel-wise Estimation
HandICVL HandsAverage 3D Error7.3Dense Pixel-wise Estimation
HandNYU HandsAverage 3D Error10.2Dense Pixel-wise Estimation
Pose EstimationMSRA HandsAverage 3D Error7.2Dense Pixel-wise Estimation
Pose EstimationICVL HandsAverage 3D Error7.3Dense Pixel-wise Estimation
Pose EstimationNYU HandsAverage 3D Error10.2Dense Pixel-wise Estimation
Hand Pose EstimationMSRA HandsAverage 3D Error7.2Dense Pixel-wise Estimation
Hand Pose EstimationICVL HandsAverage 3D Error7.3Dense Pixel-wise Estimation
Hand Pose EstimationNYU HandsAverage 3D Error10.2Dense Pixel-wise Estimation
3DMSRA HandsAverage 3D Error7.2Dense Pixel-wise Estimation
3DICVL HandsAverage 3D Error7.3Dense Pixel-wise Estimation
3DNYU HandsAverage 3D Error10.2Dense Pixel-wise Estimation
1 Image, 2*2 StitchiMSRA HandsAverage 3D Error7.2Dense Pixel-wise Estimation
1 Image, 2*2 StitchiICVL HandsAverage 3D Error7.3Dense Pixel-wise Estimation
1 Image, 2*2 StitchiNYU HandsAverage 3D Error10.2Dense Pixel-wise Estimation

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16