Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation

Ziwen Li, Bo Xu, Han Huang, Cheng Lu, Yandong Guo

2021-10-223D Human Pose Estimation Optical Flow Estimation

Abstract

Several video-based 3D pose and shape estimation algorithms have been proposed to resolve the temporal inconsistency of single-image-based methods. However it still remains challenging to have stable and accurate reconstruction. In this paper, we propose a new framework Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation (DTS-VIBE), to generate 3D human pose and mesh from RGB videos. We reformulate the task as a multi-modality problem that fuses RGB and optical flow for more reliable estimation. In order to fully utilize both sensory modalities (RGB or optical flow), we train a two-stream temporal network based on transformer to predict SMPL parameters. The supplementary modality, optical flow, helps to maintain temporal consistency by leveraging motion knowledge between two consecutive frames. The proposed algorithm is extensively evaluated on the Human3.6 and 3DPW datasets. The experimental results show that it outperforms other state-of-the-art methods by a significant margin.

Results

Task	Dataset	Metric	Value	Model
3D Human Pose Estimation	MPI-INF-3DHP	Acceleration Error	11.9	DST-VIBE
3D Human Pose Estimation	MPI-INF-3DHP	MPJPE	93.4	DST-VIBE
3D Human Pose Estimation	MPI-INF-3DHP	PA-MPJPE	62.2	DST-VIBE
3D Human Pose Estimation	3DPW	Acceleration Error	11	DST-VIBE
3D Human Pose Estimation	3DPW	MPJPE	76.7	DST-VIBE
3D Human Pose Estimation	3DPW	MPVPE	93.5	DST-VIBE
3D Human Pose Estimation	3DPW	PA-MPJPE	50.3	DST-VIBE
Pose Estimation	MPI-INF-3DHP	Acceleration Error	11.9	DST-VIBE
Pose Estimation	MPI-INF-3DHP	MPJPE	93.4	DST-VIBE
Pose Estimation	MPI-INF-3DHP	PA-MPJPE	62.2	DST-VIBE
Pose Estimation	3DPW	Acceleration Error	11	DST-VIBE
Pose Estimation	3DPW	MPJPE	76.7	DST-VIBE
Pose Estimation	3DPW	MPVPE	93.5	DST-VIBE
Pose Estimation	3DPW	PA-MPJPE	50.3	DST-VIBE
3D	MPI-INF-3DHP	Acceleration Error	11.9	DST-VIBE
3D	MPI-INF-3DHP	MPJPE	93.4	DST-VIBE
3D	MPI-INF-3DHP	PA-MPJPE	62.2	DST-VIBE
3D	3DPW	Acceleration Error	11	DST-VIBE
3D	3DPW	MPJPE	76.7	DST-VIBE
3D	3DPW	MPVPE	93.5	DST-VIBE
3D	3DPW	PA-MPJPE	50.3	DST-VIBE
1 Image, 2*2 Stitchi	MPI-INF-3DHP	Acceleration Error	11.9	DST-VIBE
1 Image, 2*2 Stitchi	MPI-INF-3DHP	MPJPE	93.4	DST-VIBE
1 Image, 2*2 Stitchi	MPI-INF-3DHP	PA-MPJPE	62.2	DST-VIBE
1 Image, 2*2 Stitchi	3DPW	Acceleration Error	11	DST-VIBE
1 Image, 2*2 Stitchi	3DPW	MPJPE	76.7	DST-VIBE
1 Image, 2*2 Stitchi	3DPW	MPVPE	93.5	DST-VIBE
1 Image, 2*2 Stitchi	3DPW	PA-MPJPE	50.3	DST-VIBE

Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation

Abstract

Results

Related Papers

Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation

Abstract

Results

Related Papers