Ziwei Yu, Linlin Yang, You Xie, Ping Chen, Angela Yao
We propose a novel framework for 3D hand shape reconstruction and hand-object grasp optimization from a single RGB image. The representation of hand-object contact regions is critical for accurate reconstructions. Instead of approximating the contact regions with sparse points, as in previous works, we propose a dense representation in the form of a UV coordinate map. Furthermore, we introduce inference-time optimization to fine-tune the grasp and improve interactions between the hand and the object. Our pipeline increases hand shape reconstruction accuracy and produces a vibrant hand texture. Experiments on datasets such as Ho3D, FreiHAND, and DexYCB reveal that our proposed method outperforms the state-of-the-art.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Hand | HO-3D v3 | PA-MPJPE | 10.8 | Yu et al. |
| Hand | HO-3D v3 | PA-MPVPE | 10.4 | Yu et al. |
| Pose Estimation | HO-3D v3 | PA-MPJPE | 10.8 | Yu et al. |
| Pose Estimation | HO-3D v3 | PA-MPVPE | 10.4 | Yu et al. |
| Hand Pose Estimation | HO-3D v3 | PA-MPJPE | 10.8 | Yu et al. |
| Hand Pose Estimation | HO-3D v3 | PA-MPVPE | 10.4 | Yu et al. |
| 3D | HO-3D v3 | PA-MPJPE | 10.8 | Yu et al. |
| 3D | HO-3D v3 | PA-MPVPE | 10.4 | Yu et al. |
| 3D Hand Pose Estimation | HO-3D v3 | PA-MPJPE | 10.8 | Yu et al. |
| 3D Hand Pose Estimation | HO-3D v3 | PA-MPVPE | 10.4 | Yu et al. |
| 1 Image, 2*2 Stitchi | HO-3D v3 | PA-MPJPE | 10.8 | Yu et al. |
| 1 Image, 2*2 Stitchi | HO-3D v3 | PA-MPVPE | 10.4 | Yu et al. |