VIBE: Video Inference for Human Body Pose and Shape Estimation

Muhammed Kocabas, Nikos Athanasiou, Michael J. Black

2019-12-11CVPR 2020 63D Human Pose Estimation 3D Shape Reconstruction Monocular 3D Human Pose Estimation Pose Estimation 3D Pose Estimation

Paper PDF Code Code Code Code(official)Code

Abstract

Human motion is fundamental to understanding behavior. Despite progress on single-image 3D pose and shape estimation, existing video-based state-of-the-art methods fail to produce accurate and natural motion sequences due to a lack of ground-truth 3D motion data for training. To address this problem, we propose Video Inference for Body Pose and Shape Estimation (VIBE), which makes use of an existing large-scale motion capture dataset (AMASS) together with unpaired, in-the-wild, 2D keypoint annotations. Our key novelty is an adversarial learning framework that leverages AMASS to discriminate between real human motions and those produced by our temporal pose and shape regression networks. We define a temporal network architecture and show that adversarial training, at the sequence level, produces kinematically plausible motion sequences without in-the-wild ground-truth 3D labels. We perform extensive experimentation to analyze the importance of motion and demonstrate the effectiveness of VIBE on challenging 3D pose estimation datasets, achieving state-of-the-art performance. Code and pretrained models are available at https://github.com/mkocabas/VIBE.

Results

Task	Dataset	Metric	Value	Model
3D Human Pose Estimation	MPI-INF-3DHP	MPJPE	96.6	VIBE
3D Human Pose Estimation	MPI-INF-3DHP	PA-MPJPE	64.6	VIBE
3D Human Pose Estimation	MPI-INF-3DHP	PCK	89.3	VIBE
3D Human Pose Estimation	3DPW	Acceleration Error	23.4	VIBE
3D Human Pose Estimation	3DPW	MPJPE	82.9	VIBE
3D Human Pose Estimation	3DPW	MPVPE	99.1	VIBE
3D Human Pose Estimation	3DPW	Number of parameters (M)	72.43	VIBE
3D Human Pose Estimation	3DPW	PA-MPJPE	51.9	VIBE
3D Human Pose Estimation	Human3.6M	Average MPJPE (mm)	65.6	VIBE
3D Human Pose Estimation	Human3.6M	PA-MPJPE	41.4	VIBE
3D Human Pose Estimation	Human3.6M	Average MPJPE (mm)	65.6	VIBE
3D Human Pose Estimation	Human3.6M	Frames Needed	16	VIBE
Pose Estimation	MPI-INF-3DHP	MPJPE	96.6	VIBE
Pose Estimation	MPI-INF-3DHP	PA-MPJPE	64.6	VIBE
Pose Estimation	MPI-INF-3DHP	PCK	89.3	VIBE
Pose Estimation	3DPW	Acceleration Error	23.4	VIBE
Pose Estimation	3DPW	MPJPE	82.9	VIBE
Pose Estimation	3DPW	MPVPE	99.1	VIBE
Pose Estimation	3DPW	Number of parameters (M)	72.43	VIBE
Pose Estimation	3DPW	PA-MPJPE	51.9	VIBE
Pose Estimation	Human3.6M	Average MPJPE (mm)	65.6	VIBE
Pose Estimation	Human3.6M	PA-MPJPE	41.4	VIBE
Pose Estimation	Human3.6M	Average MPJPE (mm)	65.6	VIBE
Pose Estimation	Human3.6M	Frames Needed	16	VIBE
3D	MPI-INF-3DHP	MPJPE	96.6	VIBE
3D	MPI-INF-3DHP	PA-MPJPE	64.6	VIBE
3D	MPI-INF-3DHP	PCK	89.3	VIBE
3D	3DPW	Acceleration Error	23.4	VIBE
3D	3DPW	MPJPE	82.9	VIBE
3D	3DPW	MPVPE	99.1	VIBE
3D	3DPW	Number of parameters (M)	72.43	VIBE
3D	3DPW	PA-MPJPE	51.9	VIBE
3D	Human3.6M	Average MPJPE (mm)	65.6	VIBE
3D	Human3.6M	PA-MPJPE	41.4	VIBE
3D	Human3.6M	Average MPJPE (mm)	65.6	VIBE
3D	Human3.6M	Frames Needed	16	VIBE
1 Image, 2*2 Stitchi	MPI-INF-3DHP	MPJPE	96.6	VIBE
1 Image, 2*2 Stitchi	MPI-INF-3DHP	PA-MPJPE	64.6	VIBE
1 Image, 2*2 Stitchi	MPI-INF-3DHP	PCK	89.3	VIBE
1 Image, 2*2 Stitchi	3DPW	Acceleration Error	23.4	VIBE
1 Image, 2*2 Stitchi	3DPW	MPJPE	82.9	VIBE
1 Image, 2*2 Stitchi	3DPW	MPVPE	99.1	VIBE
1 Image, 2*2 Stitchi	3DPW	Number of parameters (M)	72.43	VIBE
1 Image, 2*2 Stitchi	3DPW	PA-MPJPE	51.9	VIBE
1 Image, 2*2 Stitchi	Human3.6M	Average MPJPE (mm)	65.6	VIBE
1 Image, 2*2 Stitchi	Human3.6M	PA-MPJPE	41.4	VIBE
1 Image, 2*2 Stitchi	Human3.6M	Average MPJPE (mm)	65.6	VIBE
1 Image, 2*2 Stitchi	Human3.6M	Frames Needed	16	VIBE

VIBE: Video Inference for Human Body Pose and Shape Estimation

Abstract

Results

Related Papers

VIBE: Video Inference for Human Body Pose and Shape Estimation

Abstract

Results

Related Papers