TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HandOS: 3D Hand Reconstruction in One Stage

HandOS: 3D Hand Reconstruction in One Stage

Xingyu Chen, Zhuheng Song, Xiaoke Jiang, Yaoqing Hu, Junzhi Yu, Lei Zhang

2024-12-02CVPR 2025 13D Hand Pose EstimationPose EstimationHand Detection2D Pose EstimationKeypoint Estimation
PaperPDF

Abstract

Existing approaches of hand reconstruction predominantly adhere to a multi-stage framework, encompassing detection, left-right classification, and pose estimation. This paradigm induces redundant computation and cumulative errors. In this work, we propose HandOS, an end-to-end framework for 3D hand reconstruction. Our central motivation lies in leveraging a frozen detector as the foundation while incorporating auxiliary modules for 2D and 3D keypoint estimation. In this manner, we integrate the pose estimation capacity into the detection framework, while at the same time obviating the necessity of using the left-right category as a prerequisite. Specifically, we propose an interactive 2D-3D decoder, where 2D joint semantics is derived from detection cues while 3D representation is lifted from those of 2D joints. Furthermore, hierarchical attention is designed to enable the concurrent modeling of 2D joints, 3D vertices, and camera translation. Consequently, we achieve an end-to-end integration of hand detection, 2D pose estimation, and 3D mesh reconstruction within a one-stage framework, so that the above multi-stage drawbacks are overcome. Meanwhile, the HandOS reaches state-of-the-art performances on public benchmarks, e.g., 5.0 PA-MPJPE on FreiHand and 64.6\% PCK@0.05 on HInt-Ego4D. Project page: idea-research.github.io/HandOSweb.

Results

TaskDatasetMetricValueModel
HandFreiHANDPA-F@15mm0.991HandOS
HandFreiHANDPA-F@5mm0.812HandOS
HandFreiHANDPA-MPJPE5HandOS
HandFreiHANDPA-MPVPE5.3HandOS
Pose EstimationFreiHANDPA-F@15mm0.991HandOS
Pose EstimationFreiHANDPA-F@5mm0.812HandOS
Pose EstimationFreiHANDPA-MPJPE5HandOS
Pose EstimationFreiHANDPA-MPVPE5.3HandOS
Hand Pose EstimationFreiHANDPA-F@15mm0.991HandOS
Hand Pose EstimationFreiHANDPA-F@5mm0.812HandOS
Hand Pose EstimationFreiHANDPA-MPJPE5HandOS
Hand Pose EstimationFreiHANDPA-MPVPE5.3HandOS
3DFreiHANDPA-F@15mm0.991HandOS
3DFreiHANDPA-F@5mm0.812HandOS
3DFreiHANDPA-MPJPE5HandOS
3DFreiHANDPA-MPVPE5.3HandOS
3D Hand Pose EstimationFreiHANDPA-F@15mm0.991HandOS
3D Hand Pose EstimationFreiHANDPA-F@5mm0.812HandOS
3D Hand Pose EstimationFreiHANDPA-MPJPE5HandOS
3D Hand Pose EstimationFreiHANDPA-MPVPE5.3HandOS
1 Image, 2*2 StitchiFreiHANDPA-F@15mm0.991HandOS
1 Image, 2*2 StitchiFreiHANDPA-F@5mm0.812HandOS
1 Image, 2*2 StitchiFreiHANDPA-MPJPE5HandOS
1 Image, 2*2 StitchiFreiHANDPA-MPVPE5.3HandOS

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16