TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Monocular Total Capture: Posing Face, Body, and Hands in t...

Monocular Total Capture: Posing Face, Body, and Hands in the Wild

Donglai Xiang, Hanbyul Joo, Yaser Sheikh

2018-12-04CVPR 2019 63D Human Pose EstimationMonocular 3D Human Pose EstimationHand Pose Estimation
PaperPDFCode

Abstract

We present the first method to capture the 3D total motion of a target person from a monocular view input. Given an image or a monocular video, our method reconstructs the motion from body, face, and fingers represented by a 3D deformable mesh model. We use an efficient representation called 3D Part Orientation Fields (POFs), to encode the 3D orientations of all body parts in the common 2D image space. POFs are predicted by a Fully Convolutional Network (FCN), along with the joint confidence maps. To train our network, we collect a new 3D human motion dataset capturing diverse total body motion of 40 subjects in a multiview system. We leverage a 3D deformable human model to reconstruct total body pose from the CNN outputs by exploiting the pose and shape prior in the model. We also present a texture-based tracking method to obtain temporally coherent motion capture output. We perform thorough quantitative evaluations including comparison with the existing body-specific and hand-specific methods, and performance analysis on camera viewpoint and human pose changes. Finally, we demonstrate the results of our total body motion capture on various challenging in-the-wild videos. Our code and newly collected human motion dataset will be publicly shared.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)58.3Monocular Total Capture
3D Human Pose EstimationHuman3.6MFrames Needed1Monocular Total Capture
Pose EstimationHuman3.6MAverage MPJPE (mm)58.3Monocular Total Capture
Pose EstimationHuman3.6MFrames Needed1Monocular Total Capture
3DHuman3.6MAverage MPJPE (mm)58.3Monocular Total Capture
3DHuman3.6MFrames Needed1Monocular Total Capture
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)58.3Monocular Total Capture
1 Image, 2*2 StitchiHuman3.6MFrames Needed1Monocular Total Capture

Related Papers

Systematic Comparison of Projection Methods for Monocular 3D Human Pose Estimation on Fisheye Images2025-06-24ExtPose: Robust and Coherent Pose Estimation by Extending ViTs2025-06-18PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation2025-06-17Monocular 3D Hand Pose Estimation with Implicit Camera Alignment2025-06-10Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation2025-06-03EventEgoHands: Event-based Egocentric 3D Hand Mesh Reconstruction2025-05-25Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery2025-05-23UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction2025-05-20