TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Sampling is Matter: Point-guided 3D Human Mesh Reconstruct...

Sampling is Matter: Point-guided 3D Human Mesh Reconstruction

Jeonghwan Kim, Mi-Gyeong Gwon, Hyunwoo Park, Hyukmin Kwon, Gi-Mun Um, Wonjun Kim

2023-04-19CVPR 2023 13D Human Pose Estimation3D Hand Pose EstimationMonocular 3D Human Pose Estimation
PaperPDFCode(official)

Abstract

This paper presents a simple yet powerful method for 3D human mesh reconstruction from a single RGB image. Most recently, the non-local interactions of the whole mesh vertices have been effectively estimated in the transformer while the relationship between body parts also has begun to be handled via the graph model. Even though those approaches have shown the remarkable progress in 3D human mesh reconstruction, it is still difficult to directly infer the relationship between features, which are encoded from the 2D input image, and 3D coordinates of each vertex. To resolve this problem, we propose to design a simple feature sampling scheme. The key idea is to sample features in the embedded space by following the guide of points, which are estimated as projection results of 3D mesh vertices (i.e., ground truth). This helps the model to concentrate more on vertex-relevant features in the 2D space, thus leading to the reconstruction of the natural human pose. Furthermore, we apply progressive attention masking to precisely estimate local interactions between vertices even under severe occlusions. Experimental results on benchmark datasets show that the proposed method efficiently improves the performance of 3D human mesh reconstruction. The code and model are publicly available at: https://github.com/DCVL-3D/PointHMR_release.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)48.3PointHMR
3D Human Pose EstimationHuman3.6MPA-MPJPE32.9PointHMR
HandFreiHANDPA-F@15mm0.984PointHMR
HandFreiHANDPA-F@5mm0.72PointHMR
HandFreiHANDPA-MPJPE6.1PointHMR
HandFreiHANDPA-MPVPE6.6PointHMR
Pose EstimationHuman3.6MAverage MPJPE (mm)48.3PointHMR
Pose EstimationHuman3.6MPA-MPJPE32.9PointHMR
Pose EstimationFreiHANDPA-F@15mm0.984PointHMR
Pose EstimationFreiHANDPA-F@5mm0.72PointHMR
Pose EstimationFreiHANDPA-MPJPE6.1PointHMR
Pose EstimationFreiHANDPA-MPVPE6.6PointHMR
Hand Pose EstimationFreiHANDPA-F@15mm0.984PointHMR
Hand Pose EstimationFreiHANDPA-F@5mm0.72PointHMR
Hand Pose EstimationFreiHANDPA-MPJPE6.1PointHMR
Hand Pose EstimationFreiHANDPA-MPVPE6.6PointHMR
3DHuman3.6MAverage MPJPE (mm)48.3PointHMR
3DHuman3.6MPA-MPJPE32.9PointHMR
3DFreiHANDPA-F@15mm0.984PointHMR
3DFreiHANDPA-F@5mm0.72PointHMR
3DFreiHANDPA-MPJPE6.1PointHMR
3DFreiHANDPA-MPVPE6.6PointHMR
3D Hand Pose EstimationFreiHANDPA-F@15mm0.984PointHMR
3D Hand Pose EstimationFreiHANDPA-F@5mm0.72PointHMR
3D Hand Pose EstimationFreiHANDPA-MPJPE6.1PointHMR
3D Hand Pose EstimationFreiHANDPA-MPVPE6.6PointHMR
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)48.3PointHMR
1 Image, 2*2 StitchiHuman3.6MPA-MPJPE32.9PointHMR
1 Image, 2*2 StitchiFreiHANDPA-F@15mm0.984PointHMR
1 Image, 2*2 StitchiFreiHANDPA-F@5mm0.72PointHMR
1 Image, 2*2 StitchiFreiHANDPA-MPJPE6.1PointHMR
1 Image, 2*2 StitchiFreiHANDPA-MPVPE6.6PointHMR

Related Papers

Systematic Comparison of Projection Methods for Monocular 3D Human Pose Estimation on Fisheye Images2025-06-24ExtPose: Robust and Coherent Pose Estimation by Extending ViTs2025-06-18PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation2025-06-17Monocular 3D Hand Pose Estimation with Implicit Camera Alignment2025-06-10Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation2025-06-03UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction2025-05-20PoseBench3D: A Cross-Dataset Analysis Framework for 3D Human Pose Estimation2025-05-16HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation2025-05-07