TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Leveraging MoCap Data for Human Mesh Recovery

Leveraging MoCap Data for Human Mesh Recovery

Fabien Baradel, Thibault Groueix, Philippe Weinzaepfel, Romain Brégier, Yannis Kalantidis, Grégory Rogez

2021-10-183D Human Pose Estimation3D Human Shape EstimationHuman Mesh Recovery3D Human Reconstruction
PaperPDFCode

Abstract

Training state-of-the-art models for human body pose and shape recovery from images or videos requires datasets with corresponding annotations that are really hard and expensive to obtain. Our goal in this paper is to study whether poses from 3D Motion Capture (MoCap) data can be used to improve image-based and video-based human mesh recovery methods. We find that fine-tune image-based models with synthetic renderings from MoCap data can increase their performance, by providing them with a wider variety of poses, textures and backgrounds. In fact, we show that simply fine-tuning the batch normalization layers of the model is enough to achieve large gains. We further study the use of MoCap data for video, and introduce PoseBERT, a transformer module that directly regresses the pose parameters and is trained via masked modeling. It is simple, generic and can be plugged on top of any state-of-the-art image-based model in order to transform it in a video-based model leveraging temporal information. Our experimental results show that the proposed approaches reach state-of-the-art performance on various datasets including 3DPW, MPI-INF-3DHP, MuPoTS-3D, MCB and AIST. Test code and models will be available soon.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPAcceleration Error8.7MoCap-SPIN + PoseBERT
3D Human Pose EstimationMPI-INF-3DHPMPJPE97.4MoCap-SPIN + PoseBERT
3D Human Pose EstimationMPI-INF-3DHPPA-MPJPE63.3MoCap-SPIN + PoseBERT
3D Human Pose Estimation3DPWAcceleration Error8.3MoCap-SPIN + PoseBERT
3D Human Pose Estimation3DPWMPJPE89.4MoCap-SPIN + PoseBERT
3D Human Pose Estimation3DPWMPVPE103.8MoCap-SPIN + PoseBERT
3D Human Pose Estimation3DPWPA-MPJPE52.9MoCap-SPIN + PoseBERT
Pose EstimationMPI-INF-3DHPAcceleration Error8.7MoCap-SPIN + PoseBERT
Pose EstimationMPI-INF-3DHPMPJPE97.4MoCap-SPIN + PoseBERT
Pose EstimationMPI-INF-3DHPPA-MPJPE63.3MoCap-SPIN + PoseBERT
Pose Estimation3DPWAcceleration Error8.3MoCap-SPIN + PoseBERT
Pose Estimation3DPWMPJPE89.4MoCap-SPIN + PoseBERT
Pose Estimation3DPWMPVPE103.8MoCap-SPIN + PoseBERT
Pose Estimation3DPWPA-MPJPE52.9MoCap-SPIN + PoseBERT
3DMPI-INF-3DHPAcceleration Error8.7MoCap-SPIN + PoseBERT
3DMPI-INF-3DHPMPJPE97.4MoCap-SPIN + PoseBERT
3DMPI-INF-3DHPPA-MPJPE63.3MoCap-SPIN + PoseBERT
3D3DPWAcceleration Error8.3MoCap-SPIN + PoseBERT
3D3DPWMPJPE89.4MoCap-SPIN + PoseBERT
3D3DPWMPVPE103.8MoCap-SPIN + PoseBERT
3D3DPWPA-MPJPE52.9MoCap-SPIN + PoseBERT
1 Image, 2*2 StitchiMPI-INF-3DHPAcceleration Error8.7MoCap-SPIN + PoseBERT
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE97.4MoCap-SPIN + PoseBERT
1 Image, 2*2 StitchiMPI-INF-3DHPPA-MPJPE63.3MoCap-SPIN + PoseBERT
1 Image, 2*2 Stitchi3DPWAcceleration Error8.3MoCap-SPIN + PoseBERT
1 Image, 2*2 Stitchi3DPWMPJPE89.4MoCap-SPIN + PoseBERT
1 Image, 2*2 Stitchi3DPWMPVPE103.8MoCap-SPIN + PoseBERT
1 Image, 2*2 Stitchi3DPWPA-MPJPE52.9MoCap-SPIN + PoseBERT

Related Papers

Systematic Comparison of Projection Methods for Monocular 3D Human Pose Estimation on Fisheye Images2025-06-24ExtPose: Robust and Coherent Pose Estimation by Extending ViTs2025-06-18PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation2025-06-17PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images2025-06-16SMPL Normal Map Is All You Need for Single-view Textured Human Reconstruction2025-06-15MetricHMR: Metric Human Mesh Recovery from Monocular Images2025-06-11Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation2025-06-03HumanRAM: Feed-forward Human Reconstruction and Animation Model using Transformers2025-06-03