TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/WiLoR: End-to-end 3D Hand Localization and Reconstruction ...

WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild

Rolandos Alexandros Potamias, Jinglei Zhang, Jiankang Deng, Stefanos Zafeiriou

2024-09-18CVPR 2025 13D Hand Pose EstimationPose EstimationHand DetectionHand Pose Estimation
PaperPDFCode(official)

Abstract

In recent years, 3D hand pose estimation methods have garnered significant attention due to their extensive applications in human-computer interaction, virtual reality, and robotics. In contrast, there has been a notable gap in hand detection pipelines, posing significant challenges in constructing effective real-world multi-hand reconstruction systems. In this work, we present a data-driven pipeline for efficient multi-hand reconstruction in the wild. The proposed pipeline is composed of two components: a real-time fully convolutional hand localization and a high-fidelity transformer-based 3D hand reconstruction model. To tackle the limitations of previous methods and build a robust and stable detection network, we introduce a large-scale dataset with over than 2M in-the-wild hand images with diverse lighting, illumination, and occlusion conditions. Our approach outperforms previous methods in both efficiency and accuracy on popular 2D and 3D benchmarks. Finally, we showcase the effectiveness of our pipeline to achieve smooth 3D hand tracking from monocular videos, without utilizing any temporal components. Code, models, and dataset are available https://rolpotamias.github.io/WiLoR.

Results

TaskDatasetMetricValueModel
HandFreiHANDPA-F@15mm0.993WiLoR
HandFreiHANDPA-F@5mm0.825WiLoR
HandFreiHANDPA-MPJPE5.5WiLoR
HandFreiHANDPA-MPVPE5.1WiLoR
HandHO-3D v2AUC_J0.851WiLoR
HandHO-3D v2AUC_V0.846WiLoR
HandHO-3D v2F@15mm0.983WiLoR
HandHO-3D v2F@5mm0.646WiLoR
HandHO-3D v2PA-MPJPE (mm)7.5WiLoR
HandHO-3D v2PA-MPVPE7.7WiLoR
Pose EstimationFreiHANDPA-F@15mm0.993WiLoR
Pose EstimationFreiHANDPA-F@5mm0.825WiLoR
Pose EstimationFreiHANDPA-MPJPE5.5WiLoR
Pose EstimationFreiHANDPA-MPVPE5.1WiLoR
Pose EstimationHO-3D v2AUC_J0.851WiLoR
Pose EstimationHO-3D v2AUC_V0.846WiLoR
Pose EstimationHO-3D v2F@15mm0.983WiLoR
Pose EstimationHO-3D v2F@5mm0.646WiLoR
Pose EstimationHO-3D v2PA-MPJPE (mm)7.5WiLoR
Pose EstimationHO-3D v2PA-MPVPE7.7WiLoR
Hand Pose EstimationFreiHANDPA-F@15mm0.993WiLoR
Hand Pose EstimationFreiHANDPA-F@5mm0.825WiLoR
Hand Pose EstimationFreiHANDPA-MPJPE5.5WiLoR
Hand Pose EstimationFreiHANDPA-MPVPE5.1WiLoR
Hand Pose EstimationHO-3D v2AUC_J0.851WiLoR
Hand Pose EstimationHO-3D v2AUC_V0.846WiLoR
Hand Pose EstimationHO-3D v2F@15mm0.983WiLoR
Hand Pose EstimationHO-3D v2F@5mm0.646WiLoR
Hand Pose EstimationHO-3D v2PA-MPJPE (mm)7.5WiLoR
Hand Pose EstimationHO-3D v2PA-MPVPE7.7WiLoR
3DFreiHANDPA-F@15mm0.993WiLoR
3DFreiHANDPA-F@5mm0.825WiLoR
3DFreiHANDPA-MPJPE5.5WiLoR
3DFreiHANDPA-MPVPE5.1WiLoR
3DHO-3D v2AUC_J0.851WiLoR
3DHO-3D v2AUC_V0.846WiLoR
3DHO-3D v2F@15mm0.983WiLoR
3DHO-3D v2F@5mm0.646WiLoR
3DHO-3D v2PA-MPJPE (mm)7.5WiLoR
3DHO-3D v2PA-MPVPE7.7WiLoR
3D Hand Pose EstimationFreiHANDPA-F@15mm0.993WiLoR
3D Hand Pose EstimationFreiHANDPA-F@5mm0.825WiLoR
3D Hand Pose EstimationFreiHANDPA-MPJPE5.5WiLoR
3D Hand Pose EstimationFreiHANDPA-MPVPE5.1WiLoR
3D Hand Pose EstimationHO-3D v2AUC_J0.851WiLoR
3D Hand Pose EstimationHO-3D v2AUC_V0.846WiLoR
3D Hand Pose EstimationHO-3D v2F@15mm0.983WiLoR
3D Hand Pose EstimationHO-3D v2F@5mm0.646WiLoR
3D Hand Pose EstimationHO-3D v2PA-MPJPE (mm)7.5WiLoR
3D Hand Pose EstimationHO-3D v2PA-MPVPE7.7WiLoR
1 Image, 2*2 StitchiFreiHANDPA-F@15mm0.993WiLoR
1 Image, 2*2 StitchiFreiHANDPA-F@5mm0.825WiLoR
1 Image, 2*2 StitchiFreiHANDPA-MPJPE5.5WiLoR
1 Image, 2*2 StitchiFreiHANDPA-MPVPE5.1WiLoR
1 Image, 2*2 StitchiHO-3D v2AUC_J0.851WiLoR
1 Image, 2*2 StitchiHO-3D v2AUC_V0.846WiLoR
1 Image, 2*2 StitchiHO-3D v2F@15mm0.983WiLoR
1 Image, 2*2 StitchiHO-3D v2F@5mm0.646WiLoR
1 Image, 2*2 StitchiHO-3D v2PA-MPJPE (mm)7.5WiLoR
1 Image, 2*2 StitchiHO-3D v2PA-MPVPE7.7WiLoR

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16