TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HandAugment: A Simple Data Augmentation Method for Depth-B...

HandAugment: A Simple Data Augmentation Method for Depth-Based 3D Hand Pose Estimation

Zhaohui Zhang, Shipeng Xie, Mingxiu Chen, Haichao Zhu

2020-01-033D Hand Pose EstimationData AugmentationPose EstimationHand Pose Estimation
PaperPDFCode

Abstract

Hand pose estimation from 3D depth images, has been explored widely using various kinds of techniques in the field of computer vision. Though, deep learning based method improve the performance greatly recently, however, this problem still remains unsolved due to lack of large datasets, like ImageNet or effective data synthesis methods. In this paper, we propose HandAugment, a method to synthesize image data to augment the training process of the neural networks. Our method has two main parts: First, We propose a scheme of two-stage neural networks. This scheme can make the neural networks focus on the hand regions and thus to improve the performance. Second, we introduce a simple and effective method to synthesize data by combining real and synthetic image together in the image space. Finally, we show that our method achieves the first place in the task of depth-based 3D hand pose estimation in HANDS 2019 challenge.

Results

TaskDatasetMetricValueModel
HandHANDS 2019Average 3D Error13.66HandAugment
Pose EstimationHANDS 2019Average 3D Error13.66HandAugment
Hand Pose EstimationHANDS 2019Average 3D Error13.66HandAugment
3DHANDS 2019Average 3D Error13.66HandAugment
1 Image, 2*2 StitchiHANDS 2019Average 3D Error13.66HandAugment

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16