TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning to Reconstruct 3D Human Pose and Shape via Model-...

Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop

Nikos Kolotouros, Georgios Pavlakos, Michael J. Black, Kostas Daniilidis

2019-09-27ICCV 2019 103D Human Pose Estimation3D Human Shape EstimationPose Estimation3D Multi-Person Pose Estimation
PaperPDFCode

Abstract

Model-based human pose estimation is currently approached through two different paradigms. Optimization-based methods fit a parametric body model to 2D observations in an iterative manner, leading to accurate image-model alignments, but are often slow and sensitive to the initialization. In contrast, regression-based methods, that use a deep network to directly estimate the model parameters from pixels, tend to provide reasonable, but not pixel accurate, results while requiring huge amounts of supervision. In this work, instead of investigating which approach is better, our key insight is that the two paradigms can form a strong collaboration. A reasonable, directly regressed estimate from the network can initialize the iterative optimization making the fitting faster and more accurate. Similarly, a pixel accurate fit from iterative optimization can act as strong supervision for the network. This is the core of our proposed approach SPIN (SMPL oPtimization IN the loop). The deep network initializes an iterative optimization routine that fits the body model to 2D joints within the training loop, and the fitted estimate is subsequently used to supervise the network. Our approach is self-improving by nature, since better network estimates can lead the optimization to better solutions, while more accurate optimization fits provide better supervision for the network. We demonstrate the effectiveness of our approach in different settings, where 3D ground truth is scarce, or not available, and we consistently outperform the state-of-the-art model-based pose estimation approaches by significant margins. The project website with videos, results, and code can be found at https://seas.upenn.edu/~nkolot/projects/spin.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationAGORAB-MPJPE175.1SPIN
3D Human Pose EstimationAGORAB-MVE168.7SPIN
3D Human Pose EstimationAGORAB-NMJE223.1SPIN
3D Human Pose EstimationAGORAB-NMVE216.3SPIN
3D Human Pose EstimationMPI-INF-3DHPAUC37.1SPIN
3D Human Pose EstimationMPI-INF-3DHPMPJPE105.2SPIN
3D Human Pose EstimationMPI-INF-3DHPPCK76.4SPIN
3D Human Pose EstimationMPI-INF-3DHPAUC55.6SPIN (Rigid Alignment)
3D Human Pose EstimationMPI-INF-3DHPPA-MPJPE67.5SPIN (Rigid Alignment)
3D Human Pose EstimationMPI-INF-3DHPPCK92.5SPIN (Rigid Alignment)
3D Human Pose Estimation3D Poses in the Wild ChallengeMPJAE25.42SPIN
3D Human Pose Estimation3D Poses in the Wild ChallengeMPJPE102.56SPIN
3D Human Pose Estimation3DPWMPJPE96.9SPIN
3D Human Pose Estimation3DPWMPVPE116.4SPIN
3D Human Pose Estimation3DPWPA-MPJPE59.2SPIN
3D Human Pose EstimationAGORAB-MPJPE175.1SPIN
3D Human Pose EstimationAGORAB-MVE168.7SPIN
3D Human Pose EstimationAGORAB-NMJE223.1SPIN
3D Human Pose EstimationAGORAB-NMVE216.3SPIN
3D Human Pose EstimationSSP-3DPVE-T-SC22.2SPIN
3D Human Pose EstimationSSP-3DmIOU70SPIN
Pose EstimationAGORAB-MPJPE175.1SPIN
Pose EstimationAGORAB-MVE168.7SPIN
Pose EstimationAGORAB-NMJE223.1SPIN
Pose EstimationAGORAB-NMVE216.3SPIN
Pose EstimationMPI-INF-3DHPAUC37.1SPIN
Pose EstimationMPI-INF-3DHPMPJPE105.2SPIN
Pose EstimationMPI-INF-3DHPPCK76.4SPIN
Pose EstimationMPI-INF-3DHPAUC55.6SPIN (Rigid Alignment)
Pose EstimationMPI-INF-3DHPPA-MPJPE67.5SPIN (Rigid Alignment)
Pose EstimationMPI-INF-3DHPPCK92.5SPIN (Rigid Alignment)
Pose Estimation3D Poses in the Wild ChallengeMPJAE25.42SPIN
Pose Estimation3D Poses in the Wild ChallengeMPJPE102.56SPIN
Pose Estimation3DPWMPJPE96.9SPIN
Pose Estimation3DPWMPVPE116.4SPIN
Pose Estimation3DPWPA-MPJPE59.2SPIN
Pose EstimationAGORAB-MPJPE175.1SPIN
Pose EstimationAGORAB-MVE168.7SPIN
Pose EstimationAGORAB-NMJE223.1SPIN
Pose EstimationAGORAB-NMVE216.3SPIN
Pose EstimationSSP-3DPVE-T-SC22.2SPIN
Pose EstimationSSP-3DmIOU70SPIN
3DAGORAB-MPJPE175.1SPIN
3DAGORAB-MVE168.7SPIN
3DAGORAB-NMJE223.1SPIN
3DAGORAB-NMVE216.3SPIN
3DMPI-INF-3DHPAUC37.1SPIN
3DMPI-INF-3DHPMPJPE105.2SPIN
3DMPI-INF-3DHPPCK76.4SPIN
3DMPI-INF-3DHPAUC55.6SPIN (Rigid Alignment)
3DMPI-INF-3DHPPA-MPJPE67.5SPIN (Rigid Alignment)
3DMPI-INF-3DHPPCK92.5SPIN (Rigid Alignment)
3D3D Poses in the Wild ChallengeMPJAE25.42SPIN
3D3D Poses in the Wild ChallengeMPJPE102.56SPIN
3D3DPWMPJPE96.9SPIN
3D3DPWMPVPE116.4SPIN
3D3DPWPA-MPJPE59.2SPIN
3DAGORAB-MPJPE175.1SPIN
3DAGORAB-MVE168.7SPIN
3DAGORAB-NMJE223.1SPIN
3DAGORAB-NMVE216.3SPIN
3DSSP-3DPVE-T-SC22.2SPIN
3DSSP-3DmIOU70SPIN
3D Multi-Person Pose EstimationAGORAB-MPJPE175.1SPIN
3D Multi-Person Pose EstimationAGORAB-MVE168.7SPIN
3D Multi-Person Pose EstimationAGORAB-NMJE223.1SPIN
3D Multi-Person Pose EstimationAGORAB-NMVE216.3SPIN
3D Absolute Human Pose EstimationSSP-3DPVE-T-SC22.2SPIN
3D Absolute Human Pose EstimationSSP-3DmIOU70SPIN
1 Image, 2*2 StitchiAGORAB-MPJPE175.1SPIN
1 Image, 2*2 StitchiAGORAB-MVE168.7SPIN
1 Image, 2*2 StitchiAGORAB-NMJE223.1SPIN
1 Image, 2*2 StitchiAGORAB-NMVE216.3SPIN
1 Image, 2*2 StitchiMPI-INF-3DHPAUC37.1SPIN
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE105.2SPIN
1 Image, 2*2 StitchiMPI-INF-3DHPPCK76.4SPIN
1 Image, 2*2 StitchiMPI-INF-3DHPAUC55.6SPIN (Rigid Alignment)
1 Image, 2*2 StitchiMPI-INF-3DHPPA-MPJPE67.5SPIN (Rigid Alignment)
1 Image, 2*2 StitchiMPI-INF-3DHPPCK92.5SPIN (Rigid Alignment)
1 Image, 2*2 Stitchi3D Poses in the Wild ChallengeMPJAE25.42SPIN
1 Image, 2*2 Stitchi3D Poses in the Wild ChallengeMPJPE102.56SPIN
1 Image, 2*2 Stitchi3DPWMPJPE96.9SPIN
1 Image, 2*2 Stitchi3DPWMPVPE116.4SPIN
1 Image, 2*2 Stitchi3DPWPA-MPJPE59.2SPIN
1 Image, 2*2 StitchiAGORAB-MPJPE175.1SPIN
1 Image, 2*2 StitchiAGORAB-MVE168.7SPIN
1 Image, 2*2 StitchiAGORAB-NMJE223.1SPIN
1 Image, 2*2 StitchiAGORAB-NMVE216.3SPIN
1 Image, 2*2 StitchiSSP-3DPVE-T-SC22.2SPIN
1 Image, 2*2 StitchiSSP-3DmIOU70SPIN

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16