TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PoseAug: A Differentiable Pose Augmentation Framework for ...

PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation

Kehong Gong, Jianfeng Zhang, Jiashi Feng

2021-05-06CVPR 2021 13D Human Pose EstimationWeakly-supervised 3D Human Pose EstimationMonocular 3D Human Pose EstimationData AugmentationPose Estimation
PaperPDFCode(official)

Abstract

Existing 3D human pose estimators suffer poor generalization performance to new datasets, largely due to the limited diversity of 2D-3D pose pairs in the training data. To address this problem, we present PoseAug, a new auto-augmentation framework that learns to augment the available training poses towards a greater diversity and thus improve generalization of the trained 2D-to-3D pose estimator. Specifically, PoseAug introduces a novel pose augmentor that learns to adjust various geometry factors (e.g., posture, body size, view point and position) of a pose through differentiable operations. With such differentiable capacity, the augmentor can be jointly optimized with the 3D pose estimator and take the estimation error as feedback to generate more diverse and harder poses in an online manner. Moreover, PoseAug introduces a novel part-aware Kinematic Chain Space for evaluating local joint-angle plausibility and develops a discriminative module accordingly to ensure the plausibility of the augmented poses. These elaborate designs enable PoseAug to generate more diverse yet plausible poses than existing offline augmentation methods, and thus yield better generalization of the pose estimator. PoseAug is generic and easy to be applied to various 3D pose estimators. Extensive experiments demonstrate that PoseAug brings clear improvements on both intra-scenario and cross-scenario datasets. Notably, it achieves 88.6% 3D PCK on MPI-INF-3DHP under cross-dataset evaluation setup, improving upon the previous best data augmentation based method by 9.1%. Code can be found at: https://github.com/jfzhang95/PoseAug.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPAUC57.9PoseAug (+Extra2D)
3D Human Pose EstimationMPI-INF-3DHPMPJPE71.1PoseAug (+Extra2D)
3D Human Pose EstimationMPI-INF-3DHPPCK89.2PoseAug (+Extra2D)
3D Human Pose EstimationMPI-INF-3DHPAUC57.3VPose+PoseAug
3D Human Pose EstimationMPI-INF-3DHPMPJPE73VPose+PoseAug
3D Human Pose EstimationMPI-INF-3DHPPCK88.6VPose+PoseAug
3D Human Pose EstimationMPI-INF-3DHPMPJPE73.2HR-Net+VPose+PoseAug
3D Human Pose EstimationMPI-INF-3DHPMPJPE76.6HR-Net+ST-GCN+PoseAug
3D Human Pose Estimation3DPWPA-MPJPE73.2HR-Net+ST-GCN+PoseAug
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)50.2HR-Net+VPose+PoseAug
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)50.8HR-Net+ST-GCN+PoseAug
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)50.2HR-Net+VPose+PoseAug
3D Human Pose EstimationHuman3.6MPA-MPJPE39.1HR-Net+VPose+PoseAug
3D Human Pose EstimationHuman3.6MFrames Needed1PoseAug
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)56.7PoseAug
3D Human Pose EstimationHuman3.6MNumber of Frames Per View1PoseAug
3D Human Pose EstimationHuman3.6MNumber of Views1PoseAug
3D Human Pose EstimationHuman3.6MNumber of Frames Per View1PoseAug
Pose EstimationMPI-INF-3DHPAUC57.9PoseAug (+Extra2D)
Pose EstimationMPI-INF-3DHPMPJPE71.1PoseAug (+Extra2D)
Pose EstimationMPI-INF-3DHPPCK89.2PoseAug (+Extra2D)
Pose EstimationMPI-INF-3DHPAUC57.3VPose+PoseAug
Pose EstimationMPI-INF-3DHPMPJPE73VPose+PoseAug
Pose EstimationMPI-INF-3DHPPCK88.6VPose+PoseAug
Pose EstimationMPI-INF-3DHPMPJPE73.2HR-Net+VPose+PoseAug
Pose EstimationMPI-INF-3DHPMPJPE76.6HR-Net+ST-GCN+PoseAug
Pose Estimation3DPWPA-MPJPE73.2HR-Net+ST-GCN+PoseAug
Pose EstimationHuman3.6MAverage MPJPE (mm)50.2HR-Net+VPose+PoseAug
Pose EstimationHuman3.6MAverage MPJPE (mm)50.8HR-Net+ST-GCN+PoseAug
Pose EstimationHuman3.6MAverage MPJPE (mm)50.2HR-Net+VPose+PoseAug
Pose EstimationHuman3.6MPA-MPJPE39.1HR-Net+VPose+PoseAug
Pose EstimationHuman3.6MFrames Needed1PoseAug
Pose EstimationHuman3.6MAverage MPJPE (mm)56.7PoseAug
Pose EstimationHuman3.6MNumber of Frames Per View1PoseAug
Pose EstimationHuman3.6MNumber of Views1PoseAug
Pose EstimationHuman3.6MNumber of Frames Per View1PoseAug
3DMPI-INF-3DHPAUC57.9PoseAug (+Extra2D)
3DMPI-INF-3DHPMPJPE71.1PoseAug (+Extra2D)
3DMPI-INF-3DHPPCK89.2PoseAug (+Extra2D)
3DMPI-INF-3DHPAUC57.3VPose+PoseAug
3DMPI-INF-3DHPMPJPE73VPose+PoseAug
3DMPI-INF-3DHPPCK88.6VPose+PoseAug
3DMPI-INF-3DHPMPJPE73.2HR-Net+VPose+PoseAug
3DMPI-INF-3DHPMPJPE76.6HR-Net+ST-GCN+PoseAug
3D3DPWPA-MPJPE73.2HR-Net+ST-GCN+PoseAug
3DHuman3.6MAverage MPJPE (mm)50.2HR-Net+VPose+PoseAug
3DHuman3.6MAverage MPJPE (mm)50.8HR-Net+ST-GCN+PoseAug
3DHuman3.6MAverage MPJPE (mm)50.2HR-Net+VPose+PoseAug
3DHuman3.6MPA-MPJPE39.1HR-Net+VPose+PoseAug
3DHuman3.6MFrames Needed1PoseAug
3DHuman3.6MAverage MPJPE (mm)56.7PoseAug
3DHuman3.6MNumber of Frames Per View1PoseAug
3DHuman3.6MNumber of Views1PoseAug
3DHuman3.6MNumber of Frames Per View1PoseAug
1 Image, 2*2 StitchiMPI-INF-3DHPAUC57.9PoseAug (+Extra2D)
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE71.1PoseAug (+Extra2D)
1 Image, 2*2 StitchiMPI-INF-3DHPPCK89.2PoseAug (+Extra2D)
1 Image, 2*2 StitchiMPI-INF-3DHPAUC57.3VPose+PoseAug
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE73VPose+PoseAug
1 Image, 2*2 StitchiMPI-INF-3DHPPCK88.6VPose+PoseAug
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE73.2HR-Net+VPose+PoseAug
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE76.6HR-Net+ST-GCN+PoseAug
1 Image, 2*2 Stitchi3DPWPA-MPJPE73.2HR-Net+ST-GCN+PoseAug
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)50.2HR-Net+VPose+PoseAug
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)50.8HR-Net+ST-GCN+PoseAug
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)50.2HR-Net+VPose+PoseAug
1 Image, 2*2 StitchiHuman3.6MPA-MPJPE39.1HR-Net+VPose+PoseAug
1 Image, 2*2 StitchiHuman3.6MFrames Needed1PoseAug
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)56.7PoseAug
1 Image, 2*2 StitchiHuman3.6MNumber of Frames Per View1PoseAug
1 Image, 2*2 StitchiHuman3.6MNumber of Views1PoseAug
1 Image, 2*2 StitchiHuman3.6MNumber of Frames Per View1PoseAug

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16