TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Region Ensemble Network: Improving Convolutional Network f...

Region Ensemble Network: Improving Convolutional Network for Hand Pose Estimation

Hengkai Guo, Guijin Wang, Xinghao Chen, Cairong Zhang, Fei Qiao, Huazhong Yang

2017-02-08regressionPose EstimationHand Pose Estimation
PaperPDF

Abstract

Hand pose estimation from monocular depth images is an important and challenging problem for human-computer interaction. Recently deep convolutional networks (ConvNet) with sophisticated design have been employed to address it, but the improvement over traditional methods is not so apparent. To promote the performance of directly 3D coordinate regression, we propose a tree-structured Region Ensemble Network (REN), which partitions the convolution outputs into regions and integrates the results from multiple regressors on each regions. Compared with multi-model ensemble, our model is completely end-to-end training. The experimental results demonstrate that our approach achieves the best performance among state-of-the-arts on two public datasets.

Results

TaskDatasetMetricValueModel
HandMSRA HandsAverage 3D Error9.8REN
HandICVL HandsAverage 3D Error7.5REN
HandNYU HandsAverage 3D Error12.7REN
Pose EstimationMSRA HandsAverage 3D Error9.8REN
Pose EstimationICVL HandsAverage 3D Error7.5REN
Pose EstimationNYU HandsAverage 3D Error12.7REN
Hand Pose EstimationMSRA HandsAverage 3D Error9.8REN
Hand Pose EstimationICVL HandsAverage 3D Error7.5REN
Hand Pose EstimationNYU HandsAverage 3D Error12.7REN
3DMSRA HandsAverage 3D Error9.8REN
3DICVL HandsAverage 3D Error7.5REN
3DNYU HandsAverage 3D Error12.7REN
1 Image, 2*2 StitchiMSRA HandsAverage 3D Error9.8REN
1 Image, 2*2 StitchiICVL HandsAverage 3D Error7.5REN
1 Image, 2*2 StitchiNYU HandsAverage 3D Error12.7REN

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16