TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/H3WB: Human3.6M 3D WholeBody Dataset and Benchmark

H3WB: Human3.6M 3D WholeBody Dataset and Benchmark

Yue Zhu, Nermin Samet, David Picard

2022-11-28ICCV 2023 13D Human Pose Estimation3D Hand Pose EstimationPose Estimation3D Facial Landmark Localization
PaperPDFCode(official)

Abstract

We present a benchmark for 3D human whole-body pose estimation, which involves identifying accurate 3D keypoints on the entire human body, including face, hands, body, and feet. Currently, the lack of a fully annotated and accurate 3D whole-body dataset results in deep networks being trained separately on specific body parts, which are combined during inference. Or they rely on pseudo-groundtruth provided by parametric body models which are not as accurate as detection based methods. To overcome these issues, we introduce the Human3.6M 3D WholeBody (H3WB) dataset, which provides whole-body annotations for the Human3.6M dataset using the COCO Wholebody layout. H3WB comprises 133 whole-body keypoint annotations on 100K images, made possible by our new multi-view pipeline. We also propose three tasks: i) 3D whole-body pose lifting from 2D complete whole-body pose, ii) 3D whole-body pose lifting from 2D incomplete whole-body pose, and iii) 3D whole-body pose estimation from a single RGB image. Additionally, we report several baselines from popular methods for these tasks. Furthermore, we also provide automated 3D whole-body annotations of TotalCapture and experimentally show that when used with H3WB it helps to improve the performance. Code and dataset is available at https://github.com/wholebody3d/wholebody3d

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingH3WBAverage MPJPE (mm)14.6Large SimpleBaseline
Facial Recognition and ModellingH3WBAverage MPJPE (mm)17.8Jointformer
Facial Recognition and ModellingH3WBAverage MPJPE (mm)17.9CanonPose + 3D supervision
Facial Recognition and ModellingH3WBAverage MPJPE (mm)19.8Large SimpleBaseline
Facial Recognition and ModellingH3WBAverage MPJPE (mm)19.8Jointformer
Facial Recognition and ModellingH3WBAverage MPJPE (mm)20.7CPN + Jointformer
Facial Recognition and ModellingH3WBAverage MPJPE (mm)22.2CanonPose + 3D supervision
Facial Recognition and ModellingH3WBAverage MPJPE (mm)24.6CanonPose
Facial Recognition and ModellingH3WBAverage MPJPE (mm)24.6SimpleBaseline
Facial Recognition and ModellingH3WBAverage MPJPE (mm)26.3Resnet50
Facial Recognition and ModellingH3WBAverage MPJPE (mm)31.9CanonPose
Facial Recognition and ModellingH3WBAverage MPJPE (mm)32.5SHN + SimpleBaseline
Facial Recognition and ModellingH3WBAverage MPJPE (mm)34SimpleBaseline
3D Human Pose EstimationH3WBMPJPE84.9Jointformer
3D Human Pose EstimationH3WBMPJPE103Jointformer
3D Human Pose EstimationH3WBMPJPE112.6Large SimpleBaseline
3D Human Pose EstimationH3WBMPJPE117.5CanonPose + 3D supervision
3D Human Pose EstimationH3WBMPJPE125.7SimpleBaseline
3D Human Pose EstimationH3WBMPJPE131.6Large SimpleBaseline
3D Human Pose EstimationH3WBMPJPE142.8CPN + Jointformer
3D Human Pose EstimationH3WBMPJPE151.6Resnet50
3D Human Pose EstimationH3WBMPJPE155.9CanonPose + 3D supervision
3D Human Pose EstimationH3WBMPJPE189.6SHN + SimpleBaseline
3D Human Pose EstimationH3WBMPJPE193.7CanonPose
3D Human Pose EstimationH3WBMPJPE252SimpleBaseline
3D Human Pose EstimationH3WBMPJPE264.4CanonPose
HandH3WBAverage MPJPE (mm)31.7Large SimpleBaseline
HandH3WBAverage MPJPE (mm)38.3CanonPose + 3D supervision
HandH3WBAverage MPJPE (mm)42.5SimpleBaseline
HandH3WBAverage MPJPE (mm)43.7Jointformer
HandH3WBAverage MPJPE (mm)44.8Large SimpleBaseline
HandH3WBAverage MPJPE (mm)47.4CanonPose + 3D supervision
HandH3WBAverage MPJPE (mm)48.9CanonPose
HandH3WBAverage MPJPE (mm)53.5Jointformer
HandH3WBAverage MPJPE (mm)56.2CanonPose
HandH3WBAverage MPJPE (mm)56.9CPN + Jointformer
HandH3WBAverage MPJPE (mm)63.1Resnet50
HandH3WBAverage MPJPE (mm)64.3SHN + SimpleBaseline
HandH3WBAverage MPJPE (mm)83.4SimpleBaseline
Pose EstimationH3WBMPJPE84.9Jointformer
Pose EstimationH3WBMPJPE103Jointformer
Pose EstimationH3WBMPJPE112.6Large SimpleBaseline
Pose EstimationH3WBMPJPE117.5CanonPose + 3D supervision
Pose EstimationH3WBMPJPE125.7SimpleBaseline
Pose EstimationH3WBMPJPE131.6Large SimpleBaseline
Pose EstimationH3WBMPJPE142.8CPN + Jointformer
Pose EstimationH3WBMPJPE151.6Resnet50
Pose EstimationH3WBMPJPE155.9CanonPose + 3D supervision
Pose EstimationH3WBMPJPE189.6SHN + SimpleBaseline
Pose EstimationH3WBMPJPE193.7CanonPose
Pose EstimationH3WBMPJPE252SimpleBaseline
Pose EstimationH3WBMPJPE264.4CanonPose
Pose EstimationH3WBAverage MPJPE (mm)31.7Large SimpleBaseline
Pose EstimationH3WBAverage MPJPE (mm)38.3CanonPose + 3D supervision
Pose EstimationH3WBAverage MPJPE (mm)42.5SimpleBaseline
Pose EstimationH3WBAverage MPJPE (mm)43.7Jointformer
Pose EstimationH3WBAverage MPJPE (mm)44.8Large SimpleBaseline
Pose EstimationH3WBAverage MPJPE (mm)47.4CanonPose + 3D supervision
Pose EstimationH3WBAverage MPJPE (mm)48.9CanonPose
Pose EstimationH3WBAverage MPJPE (mm)53.5Jointformer
Pose EstimationH3WBAverage MPJPE (mm)56.2CanonPose
Pose EstimationH3WBAverage MPJPE (mm)56.9CPN + Jointformer
Pose EstimationH3WBAverage MPJPE (mm)63.1Resnet50
Pose EstimationH3WBAverage MPJPE (mm)64.3SHN + SimpleBaseline
Pose EstimationH3WBAverage MPJPE (mm)83.4SimpleBaseline
Hand Pose EstimationH3WBAverage MPJPE (mm)31.7Large SimpleBaseline
Hand Pose EstimationH3WBAverage MPJPE (mm)38.3CanonPose + 3D supervision
Hand Pose EstimationH3WBAverage MPJPE (mm)42.5SimpleBaseline
Hand Pose EstimationH3WBAverage MPJPE (mm)43.7Jointformer
Hand Pose EstimationH3WBAverage MPJPE (mm)44.8Large SimpleBaseline
Hand Pose EstimationH3WBAverage MPJPE (mm)47.4CanonPose + 3D supervision
Hand Pose EstimationH3WBAverage MPJPE (mm)48.9CanonPose
Hand Pose EstimationH3WBAverage MPJPE (mm)53.5Jointformer
Hand Pose EstimationH3WBAverage MPJPE (mm)56.2CanonPose
Hand Pose EstimationH3WBAverage MPJPE (mm)56.9CPN + Jointformer
Hand Pose EstimationH3WBAverage MPJPE (mm)63.1Resnet50
Hand Pose EstimationH3WBAverage MPJPE (mm)64.3SHN + SimpleBaseline
Hand Pose EstimationH3WBAverage MPJPE (mm)83.4SimpleBaseline
Facial Landmark DetectionH3WBAverage MPJPE (mm)14.6Large SimpleBaseline
Facial Landmark DetectionH3WBAverage MPJPE (mm)17.8Jointformer
Facial Landmark DetectionH3WBAverage MPJPE (mm)17.9CanonPose + 3D supervision
Facial Landmark DetectionH3WBAverage MPJPE (mm)19.8Large SimpleBaseline
Facial Landmark DetectionH3WBAverage MPJPE (mm)19.8Jointformer
Facial Landmark DetectionH3WBAverage MPJPE (mm)20.7CPN + Jointformer
Facial Landmark DetectionH3WBAverage MPJPE (mm)22.2CanonPose + 3D supervision
Facial Landmark DetectionH3WBAverage MPJPE (mm)24.6CanonPose
Facial Landmark DetectionH3WBAverage MPJPE (mm)24.6SimpleBaseline
Facial Landmark DetectionH3WBAverage MPJPE (mm)26.3Resnet50
Facial Landmark DetectionH3WBAverage MPJPE (mm)31.9CanonPose
Facial Landmark DetectionH3WBAverage MPJPE (mm)32.5SHN + SimpleBaseline
Facial Landmark DetectionH3WBAverage MPJPE (mm)34SimpleBaseline
Face ReconstructionH3WBAverage MPJPE (mm)14.6Large SimpleBaseline
Face ReconstructionH3WBAverage MPJPE (mm)17.8Jointformer
Face ReconstructionH3WBAverage MPJPE (mm)17.9CanonPose + 3D supervision
Face ReconstructionH3WBAverage MPJPE (mm)19.8Large SimpleBaseline
Face ReconstructionH3WBAverage MPJPE (mm)19.8Jointformer
Face ReconstructionH3WBAverage MPJPE (mm)20.7CPN + Jointformer
Face ReconstructionH3WBAverage MPJPE (mm)22.2CanonPose + 3D supervision
Face ReconstructionH3WBAverage MPJPE (mm)24.6CanonPose
Face ReconstructionH3WBAverage MPJPE (mm)24.6SimpleBaseline
Face ReconstructionH3WBAverage MPJPE (mm)26.3Resnet50
Face ReconstructionH3WBAverage MPJPE (mm)31.9CanonPose
Face ReconstructionH3WBAverage MPJPE (mm)32.5SHN + SimpleBaseline
Face ReconstructionH3WBAverage MPJPE (mm)34SimpleBaseline
3DH3WBMPJPE84.9Jointformer
3DH3WBMPJPE103Jointformer
3DH3WBMPJPE112.6Large SimpleBaseline
3DH3WBMPJPE117.5CanonPose + 3D supervision
3DH3WBMPJPE125.7SimpleBaseline
3DH3WBMPJPE131.6Large SimpleBaseline
3DH3WBMPJPE142.8CPN + Jointformer
3DH3WBMPJPE151.6Resnet50
3DH3WBMPJPE155.9CanonPose + 3D supervision
3DH3WBMPJPE189.6SHN + SimpleBaseline
3DH3WBMPJPE193.7CanonPose
3DH3WBMPJPE252SimpleBaseline
3DH3WBMPJPE264.4CanonPose
3DH3WBAverage MPJPE (mm)31.7Large SimpleBaseline
3DH3WBAverage MPJPE (mm)38.3CanonPose + 3D supervision
3DH3WBAverage MPJPE (mm)42.5SimpleBaseline
3DH3WBAverage MPJPE (mm)43.7Jointformer
3DH3WBAverage MPJPE (mm)44.8Large SimpleBaseline
3DH3WBAverage MPJPE (mm)47.4CanonPose + 3D supervision
3DH3WBAverage MPJPE (mm)48.9CanonPose
3DH3WBAverage MPJPE (mm)53.5Jointformer
3DH3WBAverage MPJPE (mm)56.2CanonPose
3DH3WBAverage MPJPE (mm)56.9CPN + Jointformer
3DH3WBAverage MPJPE (mm)63.1Resnet50
3DH3WBAverage MPJPE (mm)64.3SHN + SimpleBaseline
3DH3WBAverage MPJPE (mm)83.4SimpleBaseline
3DH3WBAverage MPJPE (mm)14.6Large SimpleBaseline
3DH3WBAverage MPJPE (mm)17.8Jointformer
3DH3WBAverage MPJPE (mm)17.9CanonPose + 3D supervision
3DH3WBAverage MPJPE (mm)19.8Large SimpleBaseline
3DH3WBAverage MPJPE (mm)19.8Jointformer
3DH3WBAverage MPJPE (mm)20.7CPN + Jointformer
3DH3WBAverage MPJPE (mm)22.2CanonPose + 3D supervision
3DH3WBAverage MPJPE (mm)24.6CanonPose
3DH3WBAverage MPJPE (mm)24.6SimpleBaseline
3DH3WBAverage MPJPE (mm)26.3Resnet50
3DH3WBAverage MPJPE (mm)31.9CanonPose
3DH3WBAverage MPJPE (mm)32.5SHN + SimpleBaseline
3DH3WBAverage MPJPE (mm)34SimpleBaseline
3D Face ModellingH3WBAverage MPJPE (mm)14.6Large SimpleBaseline
3D Face ModellingH3WBAverage MPJPE (mm)17.8Jointformer
3D Face ModellingH3WBAverage MPJPE (mm)17.9CanonPose + 3D supervision
3D Face ModellingH3WBAverage MPJPE (mm)19.8Large SimpleBaseline
3D Face ModellingH3WBAverage MPJPE (mm)19.8Jointformer
3D Face ModellingH3WBAverage MPJPE (mm)20.7CPN + Jointformer
3D Face ModellingH3WBAverage MPJPE (mm)22.2CanonPose + 3D supervision
3D Face ModellingH3WBAverage MPJPE (mm)24.6CanonPose
3D Face ModellingH3WBAverage MPJPE (mm)24.6SimpleBaseline
3D Face ModellingH3WBAverage MPJPE (mm)26.3Resnet50
3D Face ModellingH3WBAverage MPJPE (mm)31.9CanonPose
3D Face ModellingH3WBAverage MPJPE (mm)32.5SHN + SimpleBaseline
3D Face ModellingH3WBAverage MPJPE (mm)34SimpleBaseline
3D Face ReconstructionH3WBAverage MPJPE (mm)14.6Large SimpleBaseline
3D Face ReconstructionH3WBAverage MPJPE (mm)17.8Jointformer
3D Face ReconstructionH3WBAverage MPJPE (mm)17.9CanonPose + 3D supervision
3D Face ReconstructionH3WBAverage MPJPE (mm)19.8Large SimpleBaseline
3D Face ReconstructionH3WBAverage MPJPE (mm)19.8Jointformer
3D Face ReconstructionH3WBAverage MPJPE (mm)20.7CPN + Jointformer
3D Face ReconstructionH3WBAverage MPJPE (mm)22.2CanonPose + 3D supervision
3D Face ReconstructionH3WBAverage MPJPE (mm)24.6CanonPose
3D Face ReconstructionH3WBAverage MPJPE (mm)24.6SimpleBaseline
3D Face ReconstructionH3WBAverage MPJPE (mm)26.3Resnet50
3D Face ReconstructionH3WBAverage MPJPE (mm)31.9CanonPose
3D Face ReconstructionH3WBAverage MPJPE (mm)32.5SHN + SimpleBaseline
3D Face ReconstructionH3WBAverage MPJPE (mm)34SimpleBaseline
3D Hand Pose EstimationH3WBAverage MPJPE (mm)31.7Large SimpleBaseline
3D Hand Pose EstimationH3WBAverage MPJPE (mm)38.3CanonPose + 3D supervision
3D Hand Pose EstimationH3WBAverage MPJPE (mm)42.5SimpleBaseline
3D Hand Pose EstimationH3WBAverage MPJPE (mm)43.7Jointformer
3D Hand Pose EstimationH3WBAverage MPJPE (mm)44.8Large SimpleBaseline
3D Hand Pose EstimationH3WBAverage MPJPE (mm)47.4CanonPose + 3D supervision
3D Hand Pose EstimationH3WBAverage MPJPE (mm)48.9CanonPose
3D Hand Pose EstimationH3WBAverage MPJPE (mm)53.5Jointformer
3D Hand Pose EstimationH3WBAverage MPJPE (mm)56.2CanonPose
3D Hand Pose EstimationH3WBAverage MPJPE (mm)56.9CPN + Jointformer
3D Hand Pose EstimationH3WBAverage MPJPE (mm)63.1Resnet50
3D Hand Pose EstimationH3WBAverage MPJPE (mm)64.3SHN + SimpleBaseline
3D Hand Pose EstimationH3WBAverage MPJPE (mm)83.4SimpleBaseline
1 Image, 2*2 StitchiH3WBMPJPE84.9Jointformer
1 Image, 2*2 StitchiH3WBMPJPE103Jointformer
1 Image, 2*2 StitchiH3WBMPJPE112.6Large SimpleBaseline
1 Image, 2*2 StitchiH3WBMPJPE117.5CanonPose + 3D supervision
1 Image, 2*2 StitchiH3WBMPJPE125.7SimpleBaseline
1 Image, 2*2 StitchiH3WBMPJPE131.6Large SimpleBaseline
1 Image, 2*2 StitchiH3WBMPJPE142.8CPN + Jointformer
1 Image, 2*2 StitchiH3WBMPJPE151.6Resnet50
1 Image, 2*2 StitchiH3WBMPJPE155.9CanonPose + 3D supervision
1 Image, 2*2 StitchiH3WBMPJPE189.6SHN + SimpleBaseline
1 Image, 2*2 StitchiH3WBMPJPE193.7CanonPose
1 Image, 2*2 StitchiH3WBMPJPE252SimpleBaseline
1 Image, 2*2 StitchiH3WBMPJPE264.4CanonPose
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)31.7Large SimpleBaseline
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)38.3CanonPose + 3D supervision
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)42.5SimpleBaseline
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)43.7Jointformer
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)44.8Large SimpleBaseline
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)47.4CanonPose + 3D supervision
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)48.9CanonPose
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)53.5Jointformer
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)56.2CanonPose
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)56.9CPN + Jointformer
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)63.1Resnet50
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)64.3SHN + SimpleBaseline
1 Image, 2*2 StitchiH3WBAverage MPJPE (mm)83.4SimpleBaseline

Related Papers

$Ï€^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16