CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation

Zhihao LI, Jianzhuang Liu, Zhensong Zhang, Songcen Xu, Youliang Yan

2022-08-013D Human Pose Estimation Human Detection Unsupervised 3D Human Pose Estimation Human Mesh Recovery 3D human pose and shape estimation

Paper PDF Code Code Code Code Code(official)Code

Abstract

Top-down methods dominate the field of 3D human pose and shape estimation, because they are decoupled from human detection and allow researchers to focus on the core problem. However, cropping, their first step, discards the location information from the very beginning, which makes themselves unable to accurately predict the global rotation in the original camera coordinate system. To address this problem, we propose to Carry Location Information in Full Frames (CLIFF) into this task. Specifically, we feed more holistic features to CLIFF by concatenating the cropped-image feature with its bounding box information. We calculate the 2D reprojection loss with a broader view of the full frame, taking a projection process similar to that of the person projected in the image. Fed and supervised by global-location-aware information, CLIFF directly predicts the global rotation along with more accurate articulated poses. Besides, we propose a pseudo-ground-truth annotator based on CLIFF, which provides high-quality 3D annotations for in-the-wild 2D datasets and offers crucial full supervision for regression-based methods. Extensive experiments on popular benchmarks show that CLIFF outperforms prior arts by a significant margin, and reaches the first place on the AGORA leaderboard (the SMPL-Algorithms track). The code and data are available at https://github.com/huawei-noah/noah-research/tree/master/CLIFF.

Results

Task	Dataset	Metric	Value	Model
3D Human Pose Estimation	EMDB	Average MPJAE (deg)	23.0933	CLIFF
3D Human Pose Estimation	EMDB	Average MPJAE-PA (deg)	21.6265	CLIFF
3D Human Pose Estimation	EMDB	Average MPJPE (mm)	103.134	CLIFF
3D Human Pose Estimation	EMDB	Average MPJPE-PA (mm)	68.7969	CLIFF
3D Human Pose Estimation	EMDB	Average MVE (mm)	122.884	CLIFF
3D Human Pose Estimation	EMDB	Average MVE-PA (mm)	81.3275	CLIFF
3D Human Pose Estimation	EMDB	Jitter (10m/s^3)	55.4525	CLIFF
3D Reconstruction	Human3.6M	PA-MPJPE	32.7	CLIFF (HR-W48)
Pose Estimation	EMDB	Average MPJAE (deg)	23.0933	CLIFF
Pose Estimation	EMDB	Average MPJAE-PA (deg)	21.6265	CLIFF
Pose Estimation	EMDB	Average MPJPE (mm)	103.134	CLIFF
Pose Estimation	EMDB	Average MPJPE-PA (mm)	68.7969	CLIFF
Pose Estimation	EMDB	Average MVE (mm)	122.884	CLIFF
Pose Estimation	EMDB	Average MVE-PA (mm)	81.3275	CLIFF
Pose Estimation	EMDB	Jitter (10m/s^3)	55.4525	CLIFF
3D	EMDB	Average MPJAE (deg)	23.0933	CLIFF
3D	EMDB	Average MPJAE-PA (deg)	21.6265	CLIFF
3D	EMDB	Average MPJPE (mm)	103.134	CLIFF
3D	EMDB	Average MPJPE-PA (mm)	68.7969	CLIFF
3D	EMDB	Average MVE (mm)	122.884	CLIFF
3D	EMDB	Average MVE-PA (mm)	81.3275	CLIFF
3D	EMDB	Jitter (10m/s^3)	55.4525	CLIFF
3D	Human3.6M	PA-MPJPE	32.7	CLIFF (HR-W48)
Human Mesh Recovery	BEDLAM	PVE-All	87.6	BEDLAM-CLIFF+
Human Mesh Recovery	BEDLAM	PVE-All	94.6	BEDLAM-CLIFF
1 Image, 2*2 Stitchi	EMDB	Average MPJAE (deg)	23.0933	CLIFF
1 Image, 2*2 Stitchi	EMDB	Average MPJAE-PA (deg)	21.6265	CLIFF
1 Image, 2*2 Stitchi	EMDB	Average MPJPE (mm)	103.134	CLIFF
1 Image, 2*2 Stitchi	EMDB	Average MPJPE-PA (mm)	68.7969	CLIFF
1 Image, 2*2 Stitchi	EMDB	Average MVE (mm)	122.884	CLIFF
1 Image, 2*2 Stitchi	EMDB	Average MVE-PA (mm)	81.3275	CLIFF
1 Image, 2*2 Stitchi	EMDB	Jitter (10m/s^3)	55.4525	CLIFF

Abstract

Results

Task	Dataset	Metric	Value	Model
3D Human Pose Estimation	EMDB	Average MPJAE (deg)	23.0933	CLIFF
3D Human Pose Estimation	EMDB	Average MPJAE-PA (deg)	21.6265	CLIFF
3D Human Pose Estimation	EMDB	Average MPJPE (mm)	103.134	CLIFF
3D Human Pose Estimation	EMDB	Average MPJPE-PA (mm)	68.7969	CLIFF
3D Human Pose Estimation	EMDB	Average MVE (mm)	122.884	CLIFF
3D Human Pose Estimation	EMDB	Average MVE-PA (mm)	81.3275	CLIFF
3D Human Pose Estimation	EMDB	Jitter (10m/s^3)	55.4525	CLIFF
3D Reconstruction	Human3.6M	PA-MPJPE	32.7	CLIFF (HR-W48)
Pose Estimation	EMDB	Average MPJAE (deg)	23.0933	CLIFF
Pose Estimation	EMDB	Average MPJAE-PA (deg)	21.6265	CLIFF
Pose Estimation	EMDB	Average MPJPE (mm)	103.134	CLIFF
Pose Estimation	EMDB	Average MPJPE-PA (mm)	68.7969	CLIFF
Pose Estimation	EMDB	Average MVE (mm)	122.884	CLIFF
Pose Estimation	EMDB	Average MVE-PA (mm)	81.3275	CLIFF
Pose Estimation	EMDB	Jitter (10m/s^3)	55.4525	CLIFF
3D	EMDB	Average MPJAE (deg)	23.0933	CLIFF
3D	EMDB	Average MPJAE-PA (deg)	21.6265	CLIFF
3D	EMDB	Average MPJPE (mm)	103.134	CLIFF
3D	EMDB	Average MPJPE-PA (mm)	68.7969	CLIFF
3D	EMDB	Average MVE (mm)	122.884	CLIFF
3D	EMDB	Average MVE-PA (mm)	81.3275	CLIFF
3D	EMDB	Jitter (10m/s^3)	55.4525	CLIFF
3D	Human3.6M	PA-MPJPE	32.7	CLIFF (HR-W48)
Human Mesh Recovery	BEDLAM	PVE-All	87.6	BEDLAM-CLIFF+
Human Mesh Recovery	BEDLAM	PVE-All	94.6	BEDLAM-CLIFF
1 Image, 2*2 Stitchi	EMDB	Average MPJAE (deg)	23.0933	CLIFF
1 Image, 2*2 Stitchi	EMDB	Average MPJAE-PA (deg)	21.6265	CLIFF
1 Image, 2*2 Stitchi	EMDB	Average MPJPE (mm)	103.134	CLIFF
1 Image, 2*2 Stitchi	EMDB	Average MPJPE-PA (mm)	68.7969	CLIFF
1 Image, 2*2 Stitchi	EMDB	Average MVE (mm)	122.884	CLIFF
1 Image, 2*2 Stitchi	EMDB	Average MVE-PA (mm)	81.3275	CLIFF
1 Image, 2*2 Stitchi	EMDB	Jitter (10m/s^3)	55.4525	CLIFF

CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation

Abstract

Results

Related Papers

CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation

Abstract

Results

Related Papers