Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

Georgios Pavlakos, Vasileios Choutas, Nima Ghorbani, Timo Bolkart, Ahmed A. A. Osman, Dimitrios Tzionas, Michael J. Black

2019-04-11CVPR 2019 63D Human Pose Estimation 3D Reconstruction 3D Human Reconstruction 3D Multi-Person Mesh Recovery

Paper PDF Code(official)

Abstract

To facilitate the analysis of human actions, interactions and emotions, we compute a 3D model of human body pose, hand pose, and facial expression from a single monocular image. To achieve this, we use thousands of 3D scans to train a new, unified, 3D model of the human body, SMPL-X, that extends SMPL with fully articulated hands and an expressive face. Learning to regress the parameters of SMPL-X directly from images is challenging without paired images and 3D ground truth. Consequently, we follow the approach of SMPLify, which estimates 2D features and then optimizes model parameters to fit the features. We improve on SMPLify in several significant ways: (1) we detect 2D features corresponding to the face, hands, and feet and fit the full SMPL-X model to these; (2) we train a new neural network pose prior using a large MoCap dataset; (3) we define a new interpenetration penalty that is both fast and accurate; (4) we automatically detect gender and the appropriate body models (male, female, or neutral); (5) our PyTorch implementation achieves a speedup of more than 8x over Chumpy. We use the new method, SMPLify-X, to fit SMPL-X to both controlled images and images in the wild. We evaluate 3D accuracy on a new curated dataset comprising 100 images with pseudo ground-truth. This is a step towards automatic expressive human capture from monocular RGB data. The models, code, and data are available for research purposes at https://smpl-x.is.tue.mpg.de.

Results

Task	Dataset	Metric	Value	Model
Reconstruction	Expressive hands and faces dataset (EHF)	MPJPE, left hand	12.2	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	MPJPE-14	87.6	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	PA V2V (mm), body only	75.4	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	PA V2V (mm), face	4.9	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	PA V2V (mm), left hand	11.6	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	TR V2V (mm), body only	116.1	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	TR V2V (mm), face	11.5	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	TR V2V (mm), left hand	23.8	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	TR V2V (mm), whole body	93	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	mean P2S	36.8	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	median P2S	23	SMPLify-X
Reconstruction	AGORA	B-MPJPE	182.1	SMPLify-X
Reconstruction	AGORA	B-MVE	187	SMPLify-X
Reconstruction	AGORA	B-NMJE	256.5	SMPLify-X
Reconstruction	AGORA	B-NMVE	263.3	SMPLify-X
Reconstruction	AGORA	F-MPJPE	52.9	SMPLify-X
Reconstruction	AGORA	F-MVE	48.9	SMPLify-X
Reconstruction	AGORA	FB-MPJPE	231.8	SMPLify-X
Reconstruction	AGORA	FB-MVE	236.5	SMPLify-X
Reconstruction	AGORA	FB-NMJE	326.5	SMPLify-X
Reconstruction	AGORA	FB-NMVE	333.1	SMPLify-X
3D Human Pose Estimation	AGORA	B-MPJPE	182.1	SMPLify-X
3D Human Pose Estimation	AGORA	B-MVE	187	SMPLify-X
3D Human Pose Estimation	AGORA	B-NMJE	256.5	SMPLify-X
3D Human Pose Estimation	AGORA	B-NMVE	263.3	SMPLify-X
3D Human Pose Estimation	AGORA	F-MPJPE	52.9	SMPLify-X
3D Human Pose Estimation	AGORA	F-MVE	48.9	SMPLify-X
3D Human Pose Estimation	AGORA	FB-MPJPE	231.8	SMPLify-X
3D Human Pose Estimation	AGORA	FB-MVE	236.5	SMPLify-X
3D Human Pose Estimation	AGORA	FB-NMJE	326.5	SMPLify-X
3D Human Pose Estimation	AGORA	FB-NMVE	333.1	SMPLify-X
Emotion Recognition	Expressive hands and faces dataset (EHF).	v2v error	52.9	SMPLify-X
Pose Estimation	AGORA	B-MPJPE	182.1	SMPLify-X
Pose Estimation	AGORA	B-MVE	187	SMPLify-X
Pose Estimation	AGORA	B-NMJE	256.5	SMPLify-X
Pose Estimation	AGORA	B-NMVE	263.3	SMPLify-X
Pose Estimation	AGORA	F-MPJPE	52.9	SMPLify-X
Pose Estimation	AGORA	F-MVE	48.9	SMPLify-X
Pose Estimation	AGORA	FB-MPJPE	231.8	SMPLify-X
Pose Estimation	AGORA	FB-MVE	236.5	SMPLify-X
Pose Estimation	AGORA	FB-NMJE	326.5	SMPLify-X
Pose Estimation	AGORA	FB-NMVE	333.1	SMPLify-X
3D	AGORA	B-MPJPE	182.1	SMPLify-X
3D	AGORA	B-MVE	187	SMPLify-X
3D	AGORA	B-NMJE	256.5	SMPLify-X
3D	AGORA	B-NMVE	263.3	SMPLify-X
3D	AGORA	F-MPJPE	52.9	SMPLify-X
3D	AGORA	F-MVE	48.9	SMPLify-X
3D	AGORA	FB-MPJPE	231.8	SMPLify-X
3D	AGORA	FB-MVE	236.5	SMPLify-X
3D	AGORA	FB-NMJE	326.5	SMPLify-X
3D	AGORA	FB-NMVE	333.1	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	B-MPJPE	182.1	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	B-MVE	187	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	B-NMJE	256.5	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	B-NMVE	263.3	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	F-MPJPE	52.9	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	F-MVE	48.9	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	FB-MPJPE	231.8	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	FB-MVE	236.5	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	FB-NMJE	326.5	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	FB-NMVE	333.1	SMPLify-X
Multimodal Emotion Recognition	Expressive hands and faces dataset (EHF).	v2v error	52.9	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	B-MPJPE	182.1	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	B-MVE	187	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	B-NMJE	256.5	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	B-NMVE	263.3	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	F-MPJPE	52.9	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	F-MVE	48.9	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	FB-MPJPE	231.8	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	FB-MVE	236.5	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	FB-NMJE	326.5	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	FB-NMVE	333.1	SMPLify-X

Abstract

Results

Task	Dataset	Metric	Value	Model
Reconstruction	Expressive hands and faces dataset (EHF)	MPJPE, left hand	12.2	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	MPJPE-14	87.6	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	PA V2V (mm), body only	75.4	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	PA V2V (mm), face	4.9	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	PA V2V (mm), left hand	11.6	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	TR V2V (mm), body only	116.1	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	TR V2V (mm), face	11.5	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	TR V2V (mm), left hand	23.8	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	TR V2V (mm), whole body	93	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	mean P2S	36.8	SMPLify-X
Reconstruction	Expressive hands and faces dataset (EHF)	median P2S	23	SMPLify-X
Reconstruction	AGORA	B-MPJPE	182.1	SMPLify-X
Reconstruction	AGORA	B-MVE	187	SMPLify-X
Reconstruction	AGORA	B-NMJE	256.5	SMPLify-X
Reconstruction	AGORA	B-NMVE	263.3	SMPLify-X
Reconstruction	AGORA	F-MPJPE	52.9	SMPLify-X
Reconstruction	AGORA	F-MVE	48.9	SMPLify-X
Reconstruction	AGORA	FB-MPJPE	231.8	SMPLify-X
Reconstruction	AGORA	FB-MVE	236.5	SMPLify-X
Reconstruction	AGORA	FB-NMJE	326.5	SMPLify-X
Reconstruction	AGORA	FB-NMVE	333.1	SMPLify-X
3D Human Pose Estimation	AGORA	B-MPJPE	182.1	SMPLify-X
3D Human Pose Estimation	AGORA	B-MVE	187	SMPLify-X
3D Human Pose Estimation	AGORA	B-NMJE	256.5	SMPLify-X
3D Human Pose Estimation	AGORA	B-NMVE	263.3	SMPLify-X
3D Human Pose Estimation	AGORA	F-MPJPE	52.9	SMPLify-X
3D Human Pose Estimation	AGORA	F-MVE	48.9	SMPLify-X
3D Human Pose Estimation	AGORA	FB-MPJPE	231.8	SMPLify-X
3D Human Pose Estimation	AGORA	FB-MVE	236.5	SMPLify-X
3D Human Pose Estimation	AGORA	FB-NMJE	326.5	SMPLify-X
3D Human Pose Estimation	AGORA	FB-NMVE	333.1	SMPLify-X
Emotion Recognition	Expressive hands and faces dataset (EHF).	v2v error	52.9	SMPLify-X
Pose Estimation	AGORA	B-MPJPE	182.1	SMPLify-X
Pose Estimation	AGORA	B-MVE	187	SMPLify-X
Pose Estimation	AGORA	B-NMJE	256.5	SMPLify-X
Pose Estimation	AGORA	B-NMVE	263.3	SMPLify-X
Pose Estimation	AGORA	F-MPJPE	52.9	SMPLify-X
Pose Estimation	AGORA	F-MVE	48.9	SMPLify-X
Pose Estimation	AGORA	FB-MPJPE	231.8	SMPLify-X
Pose Estimation	AGORA	FB-MVE	236.5	SMPLify-X
Pose Estimation	AGORA	FB-NMJE	326.5	SMPLify-X
Pose Estimation	AGORA	FB-NMVE	333.1	SMPLify-X
3D	AGORA	B-MPJPE	182.1	SMPLify-X
3D	AGORA	B-MVE	187	SMPLify-X
3D	AGORA	B-NMJE	256.5	SMPLify-X
3D	AGORA	B-NMVE	263.3	SMPLify-X
3D	AGORA	F-MPJPE	52.9	SMPLify-X
3D	AGORA	F-MVE	48.9	SMPLify-X
3D	AGORA	FB-MPJPE	231.8	SMPLify-X
3D	AGORA	FB-MVE	236.5	SMPLify-X
3D	AGORA	FB-NMJE	326.5	SMPLify-X
3D	AGORA	FB-NMVE	333.1	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	B-MPJPE	182.1	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	B-MVE	187	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	B-NMJE	256.5	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	B-NMVE	263.3	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	F-MPJPE	52.9	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	F-MVE	48.9	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	FB-MPJPE	231.8	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	FB-MVE	236.5	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	FB-NMJE	326.5	SMPLify-X
3D Multi-Person Pose Estimation	AGORA	FB-NMVE	333.1	SMPLify-X
Multimodal Emotion Recognition	Expressive hands and faces dataset (EHF).	v2v error	52.9	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	B-MPJPE	182.1	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	B-MVE	187	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	B-NMJE	256.5	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	B-NMVE	263.3	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	F-MPJPE	52.9	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	F-MVE	48.9	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	FB-MPJPE	231.8	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	FB-MVE	236.5	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	FB-NMJE	326.5	SMPLify-X
1 Image, 2*2 Stitchi	AGORA	FB-NMVE	333.1	SMPLify-X

Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

Abstract

Results

Related Papers

Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

Abstract

Results

Related Papers