TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Hand Image Understanding via Deep Multi-Task Learning

Hand Image Understanding via Deep Multi-Task Learning

Xiong Zhang, Hongsheng Huang, Jianchao Tan, Hongmin Xu, Cheng Yang, Guozhu Peng, Lei Wang, Ji Liu

2021-07-24ICCV 2021 103D Hand Pose EstimationSelf-Supervised LearningPose EstimationMulti-Task LearningHand Pose Estimation
PaperPDFCode(official)

Abstract

Analyzing and understanding hand information from multimedia materials like images or videos is important for many real world applications and remains active in research community. There are various works focusing on recovering hand information from single image, however, they usually solve a single task, for example, hand mask segmentation, 2D/3D hand pose estimation, or hand mesh reconstruction and perform not well in challenging scenarios. To further improve the performance of these tasks, we propose a novel Hand Image Understanding (HIU) framework to extract comprehensive information of the hand object from a single RGB image, by jointly considering the relationships between these tasks. To achieve this goal, a cascaded multi-task learning (MTL) backbone is designed to estimate the 2D heat maps, to learn the segmentation mask, and to generate the intermediate 3D information encoding, followed by a coarse-to-fine learning paradigm and a self-supervised learning strategy. Qualitative experiments demonstrate that our approach is capable of recovering reasonable mesh representations even in challenging situations. Quantitatively, our method significantly outperforms the state-of-the-art approaches on various widely-used datasets, in terms of diverse evaluation metrics.

Results

TaskDatasetMetricValueModel
HandFreiHANDPA-F@15mm0.974HIU-DMTL
HandFreiHANDPA-F@5mm0.699HIU-DMTL
HandFreiHANDPA-MPJPE7.1HIU-DMTL
HandFreiHANDPA-MPVPE7.3HIU-DMTL
Pose EstimationFreiHANDPA-F@15mm0.974HIU-DMTL
Pose EstimationFreiHANDPA-F@5mm0.699HIU-DMTL
Pose EstimationFreiHANDPA-MPJPE7.1HIU-DMTL
Pose EstimationFreiHANDPA-MPVPE7.3HIU-DMTL
Hand Pose EstimationFreiHANDPA-F@15mm0.974HIU-DMTL
Hand Pose EstimationFreiHANDPA-F@5mm0.699HIU-DMTL
Hand Pose EstimationFreiHANDPA-MPJPE7.1HIU-DMTL
Hand Pose EstimationFreiHANDPA-MPVPE7.3HIU-DMTL
3DFreiHANDPA-F@15mm0.974HIU-DMTL
3DFreiHANDPA-F@5mm0.699HIU-DMTL
3DFreiHANDPA-MPJPE7.1HIU-DMTL
3DFreiHANDPA-MPVPE7.3HIU-DMTL
3D Hand Pose EstimationFreiHANDPA-F@15mm0.974HIU-DMTL
3D Hand Pose EstimationFreiHANDPA-F@5mm0.699HIU-DMTL
3D Hand Pose EstimationFreiHANDPA-MPJPE7.1HIU-DMTL
3D Hand Pose EstimationFreiHANDPA-MPVPE7.3HIU-DMTL
1 Image, 2*2 StitchiFreiHANDPA-F@15mm0.974HIU-DMTL
1 Image, 2*2 StitchiFreiHANDPA-F@5mm0.699HIU-DMTL
1 Image, 2*2 StitchiFreiHANDPA-MPJPE7.1HIU-DMTL
1 Image, 2*2 StitchiFreiHANDPA-MPVPE7.3HIU-DMTL

Related Papers

A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16