TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Cascaded deep monocular 3D human pose estimation with evol...

Cascaded deep monocular 3D human pose estimation with evolutionary training data

Shichao Li, Lei Ke, Kevin Pratama, Yu-Wing Tai, Chi-Keung Tang, Kwang-Ting Cheng

2020-06-14CVPR 2020 63D Human Pose EstimationWeakly-supervised 3D Human Pose EstimationRepresentation LearningMonocular 3D Human Pose EstimationData AugmentationPose Estimation
PaperPDFCode(official)

Abstract

End-to-end deep representation learning has achieved remarkable accuracy for monocular 3D human pose estimation, yet these models may fail for unseen poses with limited and fixed training data. This paper proposes a novel data augmentation method that: (1) is scalable for synthesizing massive amount of training data (over 8 million valid 3D human poses with corresponding 2D projections) for training 2D-to-3D networks, (2) can effectively reduce dataset bias. Our method evolves a limited dataset to synthesize unseen 3D human skeletons based on a hierarchical human representation and heuristics inspired by prior knowledge. Extensive experiments show that our approach not only achieves state-of-the-art accuracy on the largest public benchmark, but also generalizes significantly better to unseen and rare poses. Code, pre-trained models and tools are available at this HTTPS URL.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPAUC46.1EvoSkeleton
3D Human Pose EstimationMPI-INF-3DHPMPJPE99.7EvoSkeleton
3D Human Pose EstimationMPI-INF-3DHPPCK81.2EvoSkeleton
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)50.9TAG-Net
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)50.9TAG-Net
3D Human Pose EstimationHuman3.6MFrames Needed1TAG-Net
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)62.9Li et al.
Pose EstimationMPI-INF-3DHPAUC46.1EvoSkeleton
Pose EstimationMPI-INF-3DHPMPJPE99.7EvoSkeleton
Pose EstimationMPI-INF-3DHPPCK81.2EvoSkeleton
Pose EstimationHuman3.6MAverage MPJPE (mm)50.9TAG-Net
Pose EstimationHuman3.6MAverage MPJPE (mm)50.9TAG-Net
Pose EstimationHuman3.6MFrames Needed1TAG-Net
Pose EstimationHuman3.6MAverage MPJPE (mm)62.9Li et al.
3DMPI-INF-3DHPAUC46.1EvoSkeleton
3DMPI-INF-3DHPMPJPE99.7EvoSkeleton
3DMPI-INF-3DHPPCK81.2EvoSkeleton
3DHuman3.6MAverage MPJPE (mm)50.9TAG-Net
3DHuman3.6MAverage MPJPE (mm)50.9TAG-Net
3DHuman3.6MFrames Needed1TAG-Net
3DHuman3.6MAverage MPJPE (mm)62.9Li et al.
1 Image, 2*2 StitchiMPI-INF-3DHPAUC46.1EvoSkeleton
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE99.7EvoSkeleton
1 Image, 2*2 StitchiMPI-INF-3DHPPCK81.2EvoSkeleton
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)50.9TAG-Net
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)50.9TAG-Net
1 Image, 2*2 StitchiHuman3.6MFrames Needed1TAG-Net
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)62.9Li et al.

Related Papers

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17