Dominik Kulon, Riza Alp Güler, Iasonas Kokkinos, Michael Bronstein, Stefanos Zafeiriou
We introduce a simple and effective network architecture for monocular 3D hand pose estimation consisting of an image encoder followed by a mesh convolutional decoder that is trained through a direct 3D hand mesh reconstruction loss. We train our network by gathering a large-scale dataset of hand action in YouTube videos and use it as a source of weak supervision. Our weakly-supervised mesh convolutions-based system largely outperforms state-of-the-art methods, even halving the errors on the in the wild benchmark. The dataset and additional resources are available at https://arielai.com/mesh_hands.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Hand | FreiHAND | PA-F@15mm | 0.966 | YoutubeHand |
| Hand | FreiHAND | PA-F@5mm | 0.614 | YoutubeHand |
| Hand | FreiHAND | PA-MPJPE | 8.4 | YoutubeHand |
| Hand | FreiHAND | PA-MPVPE | 8.6 | YoutubeHand |
| Pose Estimation | FreiHAND | PA-F@15mm | 0.966 | YoutubeHand |
| Pose Estimation | FreiHAND | PA-F@5mm | 0.614 | YoutubeHand |
| Pose Estimation | FreiHAND | PA-MPJPE | 8.4 | YoutubeHand |
| Pose Estimation | FreiHAND | PA-MPVPE | 8.6 | YoutubeHand |
| Hand Pose Estimation | FreiHAND | PA-F@15mm | 0.966 | YoutubeHand |
| Hand Pose Estimation | FreiHAND | PA-F@5mm | 0.614 | YoutubeHand |
| Hand Pose Estimation | FreiHAND | PA-MPJPE | 8.4 | YoutubeHand |
| Hand Pose Estimation | FreiHAND | PA-MPVPE | 8.6 | YoutubeHand |
| 3D | FreiHAND | PA-F@15mm | 0.966 | YoutubeHand |
| 3D | FreiHAND | PA-F@5mm | 0.614 | YoutubeHand |
| 3D | FreiHAND | PA-MPJPE | 8.4 | YoutubeHand |
| 3D | FreiHAND | PA-MPVPE | 8.6 | YoutubeHand |
| 3D Hand Pose Estimation | FreiHAND | PA-F@15mm | 0.966 | YoutubeHand |
| 3D Hand Pose Estimation | FreiHAND | PA-F@5mm | 0.614 | YoutubeHand |
| 3D Hand Pose Estimation | FreiHAND | PA-MPJPE | 8.4 | YoutubeHand |
| 3D Hand Pose Estimation | FreiHAND | PA-MPVPE | 8.6 | YoutubeHand |
| 1 Image, 2*2 Stitchi | FreiHAND | PA-F@15mm | 0.966 | YoutubeHand |
| 1 Image, 2*2 Stitchi | FreiHAND | PA-F@5mm | 0.614 | YoutubeHand |
| 1 Image, 2*2 Stitchi | FreiHAND | PA-MPJPE | 8.4 | YoutubeHand |
| 1 Image, 2*2 Stitchi | FreiHAND | PA-MPVPE | 8.6 | YoutubeHand |