TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CLIFF: Carrying Location Information in Full Frames into H...

CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation

Zhihao LI, Jianzhuang Liu, Zhensong Zhang, Songcen Xu, Youliang Yan

2022-08-013D Human Pose EstimationHuman DetectionUnsupervised 3D Human Pose EstimationHuman Mesh Recovery3D human pose and shape estimation
PaperPDFCodeCodeCodeCodeCode(official)Code

Abstract

Top-down methods dominate the field of 3D human pose and shape estimation, because they are decoupled from human detection and allow researchers to focus on the core problem. However, cropping, their first step, discards the location information from the very beginning, which makes themselves unable to accurately predict the global rotation in the original camera coordinate system. To address this problem, we propose to Carry Location Information in Full Frames (CLIFF) into this task. Specifically, we feed more holistic features to CLIFF by concatenating the cropped-image feature with its bounding box information. We calculate the 2D reprojection loss with a broader view of the full frame, taking a projection process similar to that of the person projected in the image. Fed and supervised by global-location-aware information, CLIFF directly predicts the global rotation along with more accurate articulated poses. Besides, we propose a pseudo-ground-truth annotator based on CLIFF, which provides high-quality 3D annotations for in-the-wild 2D datasets and offers crucial full supervision for regression-based methods. Extensive experiments on popular benchmarks show that CLIFF outperforms prior arts by a significant margin, and reaches the first place on the AGORA leaderboard (the SMPL-Algorithms track). The code and data are available at https://github.com/huawei-noah/noah-research/tree/master/CLIFF.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationEMDBAverage MPJAE (deg)23.0933CLIFF
3D Human Pose EstimationEMDBAverage MPJAE-PA (deg)21.6265CLIFF
3D Human Pose EstimationEMDBAverage MPJPE (mm)103.134CLIFF
3D Human Pose EstimationEMDBAverage MPJPE-PA (mm)68.7969CLIFF
3D Human Pose EstimationEMDBAverage MVE (mm)122.884CLIFF
3D Human Pose EstimationEMDBAverage MVE-PA (mm)81.3275CLIFF
3D Human Pose EstimationEMDBJitter (10m/s^3)55.4525CLIFF
3D ReconstructionHuman3.6MPA-MPJPE32.7CLIFF (HR-W48)
Pose EstimationEMDBAverage MPJAE (deg)23.0933CLIFF
Pose EstimationEMDBAverage MPJAE-PA (deg)21.6265CLIFF
Pose EstimationEMDBAverage MPJPE (mm)103.134CLIFF
Pose EstimationEMDBAverage MPJPE-PA (mm)68.7969CLIFF
Pose EstimationEMDBAverage MVE (mm)122.884CLIFF
Pose EstimationEMDBAverage MVE-PA (mm)81.3275CLIFF
Pose EstimationEMDBJitter (10m/s^3)55.4525CLIFF
3DEMDBAverage MPJAE (deg)23.0933CLIFF
3DEMDBAverage MPJAE-PA (deg)21.6265CLIFF
3DEMDBAverage MPJPE (mm)103.134CLIFF
3DEMDBAverage MPJPE-PA (mm)68.7969CLIFF
3DEMDBAverage MVE (mm)122.884CLIFF
3DEMDBAverage MVE-PA (mm)81.3275CLIFF
3DEMDBJitter (10m/s^3)55.4525CLIFF
3DHuman3.6MPA-MPJPE32.7CLIFF (HR-W48)
Human Mesh RecoveryBEDLAMPVE-All87.6BEDLAM-CLIFF+
Human Mesh RecoveryBEDLAMPVE-All94.6BEDLAM-CLIFF
1 Image, 2*2 StitchiEMDBAverage MPJAE (deg)23.0933CLIFF
1 Image, 2*2 StitchiEMDBAverage MPJAE-PA (deg)21.6265CLIFF
1 Image, 2*2 StitchiEMDBAverage MPJPE (mm)103.134CLIFF
1 Image, 2*2 StitchiEMDBAverage MPJPE-PA (mm)68.7969CLIFF
1 Image, 2*2 StitchiEMDBAverage MVE (mm)122.884CLIFF
1 Image, 2*2 StitchiEMDBAverage MVE-PA (mm)81.3275CLIFF
1 Image, 2*2 StitchiEMDBJitter (10m/s^3)55.4525CLIFF

Related Papers

Systematic Comparison of Projection Methods for Monocular 3D Human Pose Estimation on Fisheye Images2025-06-24ExtPose: Robust and Coherent Pose Estimation by Extending ViTs2025-06-18PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation2025-06-17MetricHMR: Metric Human Mesh Recovery from Monocular Images2025-06-11Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation2025-06-03TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent2025-05-26R3GS: Gaussian Splatting for Robust Reconstruction and Relocalization in Unconstrained Image Collections2025-05-21UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction2025-05-20