TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PIFuHD: Multi-Level Pixel-Aligned Implicit Function for Hi...

PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization

Shunsuke Saito, Tomas Simon, Jason Saragih, Hanbyul Joo

2020-04-01CVPR 2020 63D Human Pose Estimation3D Human Shape Estimation3D Shape Reconstruction3D Human Reconstruction3D Object Reconstruction From A Single Image
PaperPDFCodeCode(official)Code

Abstract

Recent advances in image-based 3D human shape estimation have been driven by the significant improvement in representation power afforded by deep neural networks. Although current approaches have demonstrated the potential in real world settings, they still fail to produce reconstructions with the level of detail often present in the input images. We argue that this limitation stems primarily form two conflicting requirements; accurate predictions require large context, but precise predictions require high resolution. Due to memory limitations in current hardware, previous approaches tend to take low resolution images as input to cover large spatial context, and produce less precise (or low resolution) 3D estimates as a result. We address this limitation by formulating a multi-level architecture that is end-to-end trainable. A coarse level observes the whole image at lower resolution and focuses on holistic reasoning. This provides context to an fine level which estimates highly detailed geometry by observing higher-resolution images. We demonstrate that our approach significantly outperforms existing state-of-the-art techniques on single image human shape reconstruction by fully leveraging 1k-resolution input images.

Results

TaskDatasetMetricValueModel
ReconstructionCustomHumansChamfer Distance P-to-S2.107PIFuHD
ReconstructionCustomHumansChamfer Distance S-to-P2.228PIFuHD
ReconstructionCustomHumansNormal Consistency0.804PIFuHD
ReconstructionCustomHumansf-Score39.076PIFuHD
ReconstructionCAPEChamfer (cm)3.237PIFuHD
ReconstructionCAPENC0.112PIFuHD
ReconstructionCAPEP2S (cm)3.123PIFuHD
Reconstruction4D-DRESSChamfer (cm)2.393PIFuHD_Outer
Reconstruction4D-DRESSIoU0.743PIFuHD_Outer
Reconstruction4D-DRESSNormal Consistency0.763PIFuHD_Outer
Reconstruction4D-DRESSChamfer (cm)2.426PIFuHD_Inner
Reconstruction4D-DRESSIoU0.739PIFuHD_Inner
Reconstruction4D-DRESSNormal Consistency0.793PIFuHD_Inner
Object ReconstructionRenderPeopleChamfer (cm)1.525ML-PIFu (end-to-end)
Object ReconstructionBUFFChamfer (cm)1.525ML-PIFu (end-to-end)
Object ReconstructionBUFFPoint-to-surface distance (cm)0.25ML-PIFu (end-to-end)
Object ReconstructionBUFFSurface normal consistency0.22ML-PIFu (end-to-end)
Object ReconstructionBUFFChamfer (cm)1.73ML-PIFu (alternate)
Object ReconstructionBUFFPoint-to-surface distance (cm)1.63ML-PIFu (alternate)
Object ReconstructionBUFFSurface normal consistency0.133ML-PIFu (alternate)
3D Object ReconstructionRenderPeopleChamfer (cm)1.525ML-PIFu (end-to-end)
3D Object ReconstructionBUFFChamfer (cm)1.525ML-PIFu (end-to-end)
3D Object ReconstructionBUFFPoint-to-surface distance (cm)0.25ML-PIFu (end-to-end)
3D Object ReconstructionBUFFSurface normal consistency0.22ML-PIFu (end-to-end)
3D Object ReconstructionBUFFChamfer (cm)1.73ML-PIFu (alternate)
3D Object ReconstructionBUFFPoint-to-surface distance (cm)1.63ML-PIFu (alternate)
3D Object ReconstructionBUFFSurface normal consistency0.133ML-PIFu (alternate)

Related Papers

Systematic Comparison of Projection Methods for Monocular 3D Human Pose Estimation on Fisheye Images2025-06-24ExtPose: Robust and Coherent Pose Estimation by Extending ViTs2025-06-18PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation2025-06-17PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images2025-06-16SMPL Normal Map Is All You Need for Single-view Textured Human Reconstruction2025-06-15Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation2025-06-03HumanRAM: Feed-forward Human Reconstruction and Animation Model using Transformers2025-06-03UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction2025-05-20