TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Hamba: Single-view 3D Hand Reconstruction with Graph-guide...

Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning Mamba

Haoye Dong, Aviral Chharia, Wenbo Gou, Francisco Vicente Carrasco, Fernando de la Torre

2024-07-123D Hand Pose Estimation
PaperPDFCode(official)

Abstract

3D Hand reconstruction from a single RGB image is challenging due to the articulated motion, self-occlusion, and interaction with objects. Existing SOTA methods employ attention-based transformers to learn the 3D hand pose and shape, yet they do not fully achieve robust and accurate performance, primarily due to inefficiently modeling spatial relations between joints. To address this problem, we propose a novel graph-guided Mamba framework, named Hamba, which bridges graph learning and state space modeling. Our core idea is to reformulate Mamba's scanning into graph-guided bidirectional scanning for 3D reconstruction using a few effective tokens. This enables us to efficiently learn the spatial relationships between joints for improving reconstruction performance. Specifically, we design a Graph-guided State Space (GSS) block that learns the graph-structured relations and spatial sequences of joints and uses 88.5% fewer tokens than attention-based methods. Additionally, we integrate the state space features and the global features using a fusion module. By utilizing the GSS block and the fusion module, Hamba effectively leverages the graph-guided state space features and jointly considers global and local features to improve performance. Experiments on several benchmarks and in-the-wild tests demonstrate that Hamba significantly outperforms existing SOTAs, achieving the PA-MPVPE of 5.3mm and F@15mm of 0.992 on FreiHAND. At the time of this paper's acceptance, Hamba holds the top position, Rank 1 in two Competition Leaderboards on 3D hand reconstruction. Project Website: https://humansensinglab.github.io/Hamba/

Results

TaskDatasetMetricValueModel
HandHO-3D v3AUC_J0.861Hamba
HandHO-3D v3AUC_V0.864Hamba
HandHO-3D v3F@15mm0.982Hamba
HandHO-3D v3F@5mm0.681Hamba
HandHO-3D v3PA-MPJPE6.9Hamba
HandHO-3D v3PA-MPVPE6.8Hamba
HandFreiHANDPA-F@15mm0.992Hamba
HandFreiHANDPA-F@5mm0.806Hamba
HandFreiHANDPA-MPJPE5.7Hamba
HandFreiHANDPA-MPVPE5.3Hamba
HandHInt: Hand Interactions in the wildPCK@0.05 (New Days) All48.7Hamba
HandHInt: Hand Interactions in the wildPCK@0.05 (NewDays) Occ28.2Hamba
HandHInt: Hand Interactions in the wildPCK@0.05 (NewDays) Visible61.2Hamba
HandHInt: Hand Interactions in the wildPCK@0.05 (VISOR) All47.2Hamba
HandHInt: Hand Interactions in the wildPCK@0.05 (VISOR) Occ29.9Hamba
HandHInt: Hand Interactions in the wildPCK@0.05 (VISOR) Visible61.4Hamba
HandHO-3D v2AUC_J0.85Hamba
HandHO-3D v2AUC_V0.846Hamba
HandHO-3D v2F@15mm0.982Hamba
HandHO-3D v2F@5mm0.648Hamba
HandHO-3D v2PA-MPJPE (mm)7.5Hamba
HandHO-3D v2PA-MPVPE7.7Hamba
Pose EstimationHO-3D v3AUC_J0.861Hamba
Pose EstimationHO-3D v3AUC_V0.864Hamba
Pose EstimationHO-3D v3F@15mm0.982Hamba
Pose EstimationHO-3D v3F@5mm0.681Hamba
Pose EstimationHO-3D v3PA-MPJPE6.9Hamba
Pose EstimationHO-3D v3PA-MPVPE6.8Hamba
Pose EstimationFreiHANDPA-F@15mm0.992Hamba
Pose EstimationFreiHANDPA-F@5mm0.806Hamba
Pose EstimationFreiHANDPA-MPJPE5.7Hamba
Pose EstimationFreiHANDPA-MPVPE5.3Hamba
Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (New Days) All48.7Hamba
Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (NewDays) Occ28.2Hamba
Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (NewDays) Visible61.2Hamba
Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (VISOR) All47.2Hamba
Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (VISOR) Occ29.9Hamba
Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (VISOR) Visible61.4Hamba
Pose EstimationHO-3D v2AUC_J0.85Hamba
Pose EstimationHO-3D v2AUC_V0.846Hamba
Pose EstimationHO-3D v2F@15mm0.982Hamba
Pose EstimationHO-3D v2F@5mm0.648Hamba
Pose EstimationHO-3D v2PA-MPJPE (mm)7.5Hamba
Pose EstimationHO-3D v2PA-MPVPE7.7Hamba
Hand Pose EstimationHO-3D v3AUC_J0.861Hamba
Hand Pose EstimationHO-3D v3AUC_V0.864Hamba
Hand Pose EstimationHO-3D v3F@15mm0.982Hamba
Hand Pose EstimationHO-3D v3F@5mm0.681Hamba
Hand Pose EstimationHO-3D v3PA-MPJPE6.9Hamba
Hand Pose EstimationHO-3D v3PA-MPVPE6.8Hamba
Hand Pose EstimationFreiHANDPA-F@15mm0.992Hamba
Hand Pose EstimationFreiHANDPA-F@5mm0.806Hamba
Hand Pose EstimationFreiHANDPA-MPJPE5.7Hamba
Hand Pose EstimationFreiHANDPA-MPVPE5.3Hamba
Hand Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (New Days) All48.7Hamba
Hand Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (NewDays) Occ28.2Hamba
Hand Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (NewDays) Visible61.2Hamba
Hand Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (VISOR) All47.2Hamba
Hand Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (VISOR) Occ29.9Hamba
Hand Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (VISOR) Visible61.4Hamba
Hand Pose EstimationHO-3D v2AUC_J0.85Hamba
Hand Pose EstimationHO-3D v2AUC_V0.846Hamba
Hand Pose EstimationHO-3D v2F@15mm0.982Hamba
Hand Pose EstimationHO-3D v2F@5mm0.648Hamba
Hand Pose EstimationHO-3D v2PA-MPJPE (mm)7.5Hamba
Hand Pose EstimationHO-3D v2PA-MPVPE7.7Hamba
3DHO-3D v3AUC_J0.861Hamba
3DHO-3D v3AUC_V0.864Hamba
3DHO-3D v3F@15mm0.982Hamba
3DHO-3D v3F@5mm0.681Hamba
3DHO-3D v3PA-MPJPE6.9Hamba
3DHO-3D v3PA-MPVPE6.8Hamba
3DFreiHANDPA-F@15mm0.992Hamba
3DFreiHANDPA-F@5mm0.806Hamba
3DFreiHANDPA-MPJPE5.7Hamba
3DFreiHANDPA-MPVPE5.3Hamba
3DHInt: Hand Interactions in the wildPCK@0.05 (New Days) All48.7Hamba
3DHInt: Hand Interactions in the wildPCK@0.05 (NewDays) Occ28.2Hamba
3DHInt: Hand Interactions in the wildPCK@0.05 (NewDays) Visible61.2Hamba
3DHInt: Hand Interactions in the wildPCK@0.05 (VISOR) All47.2Hamba
3DHInt: Hand Interactions in the wildPCK@0.05 (VISOR) Occ29.9Hamba
3DHInt: Hand Interactions in the wildPCK@0.05 (VISOR) Visible61.4Hamba
3DHO-3D v2AUC_J0.85Hamba
3DHO-3D v2AUC_V0.846Hamba
3DHO-3D v2F@15mm0.982Hamba
3DHO-3D v2F@5mm0.648Hamba
3DHO-3D v2PA-MPJPE (mm)7.5Hamba
3DHO-3D v2PA-MPVPE7.7Hamba
3D Hand Pose EstimationHO-3D v3AUC_J0.861Hamba
3D Hand Pose EstimationHO-3D v3AUC_V0.864Hamba
3D Hand Pose EstimationHO-3D v3F@15mm0.982Hamba
3D Hand Pose EstimationHO-3D v3F@5mm0.681Hamba
3D Hand Pose EstimationHO-3D v3PA-MPJPE6.9Hamba
3D Hand Pose EstimationHO-3D v3PA-MPVPE6.8Hamba
3D Hand Pose EstimationFreiHANDPA-F@15mm0.992Hamba
3D Hand Pose EstimationFreiHANDPA-F@5mm0.806Hamba
3D Hand Pose EstimationFreiHANDPA-MPJPE5.7Hamba
3D Hand Pose EstimationFreiHANDPA-MPVPE5.3Hamba
3D Hand Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (New Days) All48.7Hamba
3D Hand Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (NewDays) Occ28.2Hamba
3D Hand Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (NewDays) Visible61.2Hamba
3D Hand Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (VISOR) All47.2Hamba
3D Hand Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (VISOR) Occ29.9Hamba
3D Hand Pose EstimationHInt: Hand Interactions in the wildPCK@0.05 (VISOR) Visible61.4Hamba
3D Hand Pose EstimationHO-3D v2AUC_J0.85Hamba
3D Hand Pose EstimationHO-3D v2AUC_V0.846Hamba
3D Hand Pose EstimationHO-3D v2F@15mm0.982Hamba
3D Hand Pose EstimationHO-3D v2F@5mm0.648Hamba
3D Hand Pose EstimationHO-3D v2PA-MPJPE (mm)7.5Hamba
3D Hand Pose EstimationHO-3D v2PA-MPVPE7.7Hamba
1 Image, 2*2 StitchiHO-3D v3AUC_J0.861Hamba
1 Image, 2*2 StitchiHO-3D v3AUC_V0.864Hamba
1 Image, 2*2 StitchiHO-3D v3F@15mm0.982Hamba
1 Image, 2*2 StitchiHO-3D v3F@5mm0.681Hamba
1 Image, 2*2 StitchiHO-3D v3PA-MPJPE6.9Hamba
1 Image, 2*2 StitchiHO-3D v3PA-MPVPE6.8Hamba
1 Image, 2*2 StitchiFreiHANDPA-F@15mm0.992Hamba
1 Image, 2*2 StitchiFreiHANDPA-F@5mm0.806Hamba
1 Image, 2*2 StitchiFreiHANDPA-MPJPE5.7Hamba
1 Image, 2*2 StitchiFreiHANDPA-MPVPE5.3Hamba
1 Image, 2*2 StitchiHInt: Hand Interactions in the wildPCK@0.05 (New Days) All48.7Hamba
1 Image, 2*2 StitchiHInt: Hand Interactions in the wildPCK@0.05 (NewDays) Occ28.2Hamba
1 Image, 2*2 StitchiHInt: Hand Interactions in the wildPCK@0.05 (NewDays) Visible61.2Hamba
1 Image, 2*2 StitchiHInt: Hand Interactions in the wildPCK@0.05 (VISOR) All47.2Hamba
1 Image, 2*2 StitchiHInt: Hand Interactions in the wildPCK@0.05 (VISOR) Occ29.9Hamba
1 Image, 2*2 StitchiHInt: Hand Interactions in the wildPCK@0.05 (VISOR) Visible61.4Hamba
1 Image, 2*2 StitchiHO-3D v2AUC_J0.85Hamba
1 Image, 2*2 StitchiHO-3D v2AUC_V0.846Hamba
1 Image, 2*2 StitchiHO-3D v2F@15mm0.982Hamba
1 Image, 2*2 StitchiHO-3D v2F@5mm0.648Hamba
1 Image, 2*2 StitchiHO-3D v2PA-MPJPE (mm)7.5Hamba
1 Image, 2*2 StitchiHO-3D v2PA-MPVPE7.7Hamba

Related Papers

ExtPose: Robust and Coherent Pose Estimation by Extending ViTs2025-06-18Monocular 3D Hand Pose Estimation with Implicit Camera Alignment2025-06-10OccRobNet : Occlusion Robust Network for Accurate 3D Interacting Hand-Object Pose Estimation2025-03-27Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation2025-03-25SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training2025-02-21HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos2025-01-06BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting2025-01-01MMHMR: Generative Masked Modeling for Hand Mesh Recovery2024-12-18