TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HDNet: Human Depth Estimation for Multi-Person Camera-Spac...

HDNet: Human Depth Estimation for Multi-Person Camera-Space Localization

Jiahao Lin, Gim Hee Lee

2020-07-17ECCV 2020 8Root Joint LocalizationPose EstimationDepth Estimation3D Multi-Person Pose Estimation (root-relative)3D Multi-Person Pose Estimation (absolute)3D Pose Estimation
PaperPDFCode(official)

Abstract

Current works on multi-person 3D pose estimation mainly focus on the estimation of the 3D joint locations relative to the root joint and ignore the absolute locations of each pose. In this paper, we propose the Human Depth Estimation Network (HDNet), an end-to-end framework for absolute root joint localization in the camera coordinate space. Our HDNet first estimates the 2D human pose with heatmaps of the joints. These estimated heatmaps serve as attention masks for pooling features from image regions corresponding to the target person. A skeleton-based Graph Neural Network (GNN) is utilized to propagate features among joints. We formulate the target depth regression as a bin index estimation problem, which can be transformed with a soft-argmax operation from the classification output of our HDNet. We evaluate our HDNet on the root joint localization and root-relative 3D pose estimation tasks with two benchmark datasets, i.e., Human3.6M and MuPoTS-3D. The experimental results show that we outperform the previous state-of-the-art consistently under multiple evaluation metrics. Our source code is available at: https://github.com/jiahaoLjh/HumanDepth.

Results

TaskDatasetMetricValueModel
3D Multi-Person Pose Estimation (root-relative)MuPoTS-3D3DPCK83.7HDNet
3D Human Pose EstimationMuPoTS-3D3DPCK35.2HDNet
3D Human Pose EstimationMuPoTS-3D3DPCK83.7HDNet
3D Multi-Person Pose Estimation (absolute)MuPoTS-3D3DPCK35.2HDNet
Pose EstimationMuPoTS-3D3DPCK35.2HDNet
Pose EstimationMuPoTS-3D3DPCK83.7HDNet
3DMuPoTS-3D3DPCK35.2HDNet
3DMuPoTS-3D3DPCK83.7HDNet
3D Multi-Person Pose EstimationMuPoTS-3D3DPCK35.2HDNet
3D Multi-Person Pose EstimationMuPoTS-3D3DPCK83.7HDNet
1 Image, 2*2 StitchiMuPoTS-3D3DPCK35.2HDNet
1 Image, 2*2 StitchiMuPoTS-3D3DPCK83.7HDNet

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16