TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/3D WholeBody Pose Estimation based on Semantic Graph Atten...

3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information

Sihan Wen, Xiantan Zhu, Zhiming Tan

2024-06-03Pose Estimation3D Pose EstimationGraph Attention
PaperPDF

Abstract

In recent years, a plethora of diverse methods have been proposed for 3D pose estimation. Among these, self-attention mechanisms and graph convolutions have both been proven to be effective and practical methods. Recognizing the strengths of those two techniques, we have developed a novel Semantic Graph Attention Network which can benefit from the ability of self-attention to capture global context, while also utilizing the graph convolutions to handle the local connectivity and structural constraints of the skeleton. We also design a Body Part Decoder that assists in extracting and refining the information related to specific segments of the body. Furthermore, our approach incorporates Distance Information, enhancing our model's capability to comprehend and accurately predict spatial relationships. Finally, we introduce a Geometry Loss who makes a critical constraint on the structural skeleton of the body, ensuring that the model's predictions adhere to the natural limits of human posture. The experimental results validate the effectiveness of our approach, demonstrating that every element within the system is essential for improving pose estimation outcomes. With comparison to state-of-the-art, the proposed work not only meets but exceeds the existing benchmarks.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingH3WBAverage MPJPE (mm)15.95SemGAN
3D Human Pose EstimationH3WBMPJPE45.39SemGAN
HandH3WBAverage MPJPE (mm)27.77SemGAN
Pose EstimationH3WBMPJPE45.39SemGAN
Pose EstimationH3WBAverage MPJPE (mm)27.77SemGAN
Hand Pose EstimationH3WBAverage MPJPE (mm)27.77SemGAN
Facial Landmark DetectionH3WBAverage MPJPE (mm)15.95SemGAN
Face ReconstructionH3WBAverage MPJPE (mm)15.95SemGAN
3DH3WBMPJPE45.39SemGAN
3DH3WBAverage MPJPE (mm)27.77SemGAN
3DH3WBAverage MPJPE (mm)15.95SemGAN
3D Face ModellingH3WBAverage MPJPE (mm)15.95SemGAN
3D Face ReconstructionH3WBAverage MPJPE (mm)15.95SemGAN
3D Hand Pose EstimationH3WBAverage MPJPE (mm)27.77SemGAN
1 Image, 2*2 StitchiH3WBMPJPE45.39SemGAN
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)27.77SemGAN

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16