TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Expressive Body Capture: 3D Hands, Face, and Body from a S...

Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

Georgios Pavlakos, Vasileios Choutas, Nima Ghorbani, Timo Bolkart, Ahmed A. A. Osman, Dimitrios Tzionas, Michael J. Black

2019-04-11CVPR 2019 63D Human Pose Estimation3D Reconstruction3D Human Reconstruction3D Multi-Person Mesh Recovery
PaperPDFCode(official)

Abstract

To facilitate the analysis of human actions, interactions and emotions, we compute a 3D model of human body pose, hand pose, and facial expression from a single monocular image. To achieve this, we use thousands of 3D scans to train a new, unified, 3D model of the human body, SMPL-X, that extends SMPL with fully articulated hands and an expressive face. Learning to regress the parameters of SMPL-X directly from images is challenging without paired images and 3D ground truth. Consequently, we follow the approach of SMPLify, which estimates 2D features and then optimizes model parameters to fit the features. We improve on SMPLify in several significant ways: (1) we detect 2D features corresponding to the face, hands, and feet and fit the full SMPL-X model to these; (2) we train a new neural network pose prior using a large MoCap dataset; (3) we define a new interpenetration penalty that is both fast and accurate; (4) we automatically detect gender and the appropriate body models (male, female, or neutral); (5) our PyTorch implementation achieves a speedup of more than 8x over Chumpy. We use the new method, SMPLify-X, to fit SMPL-X to both controlled images and images in the wild. We evaluate 3D accuracy on a new curated dataset comprising 100 images with pseudo ground-truth. This is a step towards automatic expressive human capture from monocular RGB data. The models, code, and data are available for research purposes at https://smpl-x.is.tue.mpg.de.

Results

TaskDatasetMetricValueModel
ReconstructionExpressive hands and faces dataset (EHF)MPJPE, left hand12.2SMPLify-X
ReconstructionExpressive hands and faces dataset (EHF)MPJPE-1487.6SMPLify-X
ReconstructionExpressive hands and faces dataset (EHF)PA V2V (mm), body only75.4SMPLify-X
ReconstructionExpressive hands and faces dataset (EHF)PA V2V (mm), face4.9SMPLify-X
ReconstructionExpressive hands and faces dataset (EHF)PA V2V (mm), left hand11.6SMPLify-X
ReconstructionExpressive hands and faces dataset (EHF)TR V2V (mm), body only116.1SMPLify-X
ReconstructionExpressive hands and faces dataset (EHF)TR V2V (mm), face11.5SMPLify-X
ReconstructionExpressive hands and faces dataset (EHF)TR V2V (mm), left hand23.8SMPLify-X
ReconstructionExpressive hands and faces dataset (EHF)TR V2V (mm), whole body93SMPLify-X
ReconstructionExpressive hands and faces dataset (EHF)mean P2S36.8SMPLify-X
ReconstructionExpressive hands and faces dataset (EHF)median P2S23SMPLify-X
ReconstructionAGORAB-MPJPE182.1SMPLify-X
ReconstructionAGORAB-MVE187SMPLify-X
ReconstructionAGORAB-NMJE256.5SMPLify-X
ReconstructionAGORAB-NMVE263.3SMPLify-X
ReconstructionAGORAF-MPJPE52.9SMPLify-X
ReconstructionAGORAF-MVE48.9SMPLify-X
ReconstructionAGORAFB-MPJPE231.8SMPLify-X
ReconstructionAGORAFB-MVE236.5SMPLify-X
ReconstructionAGORAFB-NMJE326.5SMPLify-X
ReconstructionAGORAFB-NMVE333.1SMPLify-X
3D Human Pose EstimationAGORAB-MPJPE182.1SMPLify-X
3D Human Pose EstimationAGORAB-MVE187SMPLify-X
3D Human Pose EstimationAGORAB-NMJE256.5SMPLify-X
3D Human Pose EstimationAGORAB-NMVE263.3SMPLify-X
3D Human Pose EstimationAGORAF-MPJPE52.9SMPLify-X
3D Human Pose EstimationAGORAF-MVE48.9SMPLify-X
3D Human Pose EstimationAGORAFB-MPJPE231.8SMPLify-X
3D Human Pose EstimationAGORAFB-MVE236.5SMPLify-X
3D Human Pose EstimationAGORAFB-NMJE326.5SMPLify-X
3D Human Pose EstimationAGORAFB-NMVE333.1SMPLify-X
Emotion RecognitionExpressive hands and faces dataset (EHF).v2v error52.9SMPLify-X
Pose EstimationAGORAB-MPJPE182.1SMPLify-X
Pose EstimationAGORAB-MVE187SMPLify-X
Pose EstimationAGORAB-NMJE256.5SMPLify-X
Pose EstimationAGORAB-NMVE263.3SMPLify-X
Pose EstimationAGORAF-MPJPE52.9SMPLify-X
Pose EstimationAGORAF-MVE48.9SMPLify-X
Pose EstimationAGORAFB-MPJPE231.8SMPLify-X
Pose EstimationAGORAFB-MVE236.5SMPLify-X
Pose EstimationAGORAFB-NMJE326.5SMPLify-X
Pose EstimationAGORAFB-NMVE333.1SMPLify-X
3DAGORAB-MPJPE182.1SMPLify-X
3DAGORAB-MVE187SMPLify-X
3DAGORAB-NMJE256.5SMPLify-X
3DAGORAB-NMVE263.3SMPLify-X
3DAGORAF-MPJPE52.9SMPLify-X
3DAGORAF-MVE48.9SMPLify-X
3DAGORAFB-MPJPE231.8SMPLify-X
3DAGORAFB-MVE236.5SMPLify-X
3DAGORAFB-NMJE326.5SMPLify-X
3DAGORAFB-NMVE333.1SMPLify-X
3D Multi-Person Pose EstimationAGORAB-MPJPE182.1SMPLify-X
3D Multi-Person Pose EstimationAGORAB-MVE187SMPLify-X
3D Multi-Person Pose EstimationAGORAB-NMJE256.5SMPLify-X
3D Multi-Person Pose EstimationAGORAB-NMVE263.3SMPLify-X
3D Multi-Person Pose EstimationAGORAF-MPJPE52.9SMPLify-X
3D Multi-Person Pose EstimationAGORAF-MVE48.9SMPLify-X
3D Multi-Person Pose EstimationAGORAFB-MPJPE231.8SMPLify-X
3D Multi-Person Pose EstimationAGORAFB-MVE236.5SMPLify-X
3D Multi-Person Pose EstimationAGORAFB-NMJE326.5SMPLify-X
3D Multi-Person Pose EstimationAGORAFB-NMVE333.1SMPLify-X
Multimodal Emotion RecognitionExpressive hands and faces dataset (EHF).v2v error52.9SMPLify-X
1 Image, 2*2 StitchiAGORAB-MPJPE182.1SMPLify-X
1 Image, 2*2 StitchiAGORAB-MVE187SMPLify-X
1 Image, 2*2 StitchiAGORAB-NMJE256.5SMPLify-X
1 Image, 2*2 StitchiAGORAB-NMVE263.3SMPLify-X
1 Image, 2*2 StitchiAGORAF-MPJPE52.9SMPLify-X
1 Image, 2*2 StitchiAGORAF-MVE48.9SMPLify-X
1 Image, 2*2 StitchiAGORAFB-MPJPE231.8SMPLify-X
1 Image, 2*2 StitchiAGORAFB-MVE236.5SMPLify-X
1 Image, 2*2 StitchiAGORAFB-NMJE326.5SMPLify-X
1 Image, 2*2 StitchiAGORAFB-NMVE333.1SMPLify-X

Related Papers

AutoPartGen: Autogressive 3D Part Generation and Discovery2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images2025-07-16Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation2025-07-15Binomial Self-Compensation: Mechanism and Suppression of Motion Error in Phase-Shifting Profilometry2025-07-14An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT2025-07-11DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation2025-07-08