TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HPRNet: Hierarchical Point Regression for Whole-Body Human...

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Nermin Samet, Emre Akbas

2021-06-08regression2D Human Pose EstimationFacial Landmark DetectionPose EstimationMulti-Person Pose EstimationKeypoint DetectionFoot keypoint detectionHand Pose EstimationFace Detection
PaperPDFCode(official)

Abstract

In this paper, we present a new bottom-up one-stage method for whole-body pose estimation, which we call "hierarchical point regression," or HPRNet for short. In standard body pose estimation, the locations of $\sim 17$ major joints on the human body are estimated. Differently, in whole-body pose estimation, the locations of fine-grained keypoints (68 on face, 21 on each hand and 3 on each foot) are estimated as well, which creates a scale variance problem that needs to be addressed. To handle the scale variance among different body parts, we build a hierarchical point representation of body parts and jointly regress them. The relative locations of fine-grained keypoints in each part (e.g. face) are regressed in reference to the center of that part, whose location itself is estimated relative to the person center. In addition, unlike the existing two-stage methods, our method predicts whole-body pose in a constant time independent of the number of people in an image. On the COCO WholeBody dataset, HPRNet significantly outperforms all previous bottom-up methods on the keypoint detection of all whole-body parts (i.e. body, foot, face and hand); it also achieves state-of-the-art results on face (75.4 AP) and hand (50.4 AP) keypoint detection. Code and models are available at \url{https://github.com/nerminsamet/HPRNet}.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingCOCO-WholeBodyAP56.4HPRNet (Hourglass-104)
Facial Recognition and ModellingCOCO-WholeBodyAP5082.4HPRNet (Hourglass-104)
Facial Recognition and ModellingCOCO-WholeBodyAP7567.1HPRNet (Hourglass-104)
Facial Recognition and ModellingCOCO-WholeBodyAPL63.3HPRNet (Hourglass-104)
Facial Recognition and ModellingCOCO-WholeBodyAPM43.4HPRNet (Hourglass-104)
Facial Recognition and ModellingCOCO-WholeBodyAP55.8HPRNet (DLA)
Facial Recognition and ModellingCOCO-WholeBodyAP5082.3HPRNet (DLA)
Facial Recognition and ModellingCOCO-WholeBodyAP7566.2HPRNet (DLA)
Facial Recognition and ModellingCOCO-WholeBodyAPL63.6HPRNet (DLA)
Facial Recognition and ModellingCOCO-WholeBodyAPM40HPRNet (DLA)
Facial Recognition and ModellingCOCO-WholeBodykeypoint AP75.4HPRNet (Hourglass-104)
Facial Recognition and ModellingCOCO-WholeBodykeypoint AP74.6HPRNet (DLA)
HandCOCO-WholeBodykeypoint AP50.4HPRNet (Hourglass-104)
HandCOCO-WholeBodykeypoint AP47HPRNet (DLA)
Pose EstimationCOCO-WholeBodykeypoint AP50.4HPRNet (Hourglass-104)
Pose EstimationCOCO-WholeBodykeypoint AP47HPRNet (DLA)
Pose EstimationCOCO-WholeBodykeypoint AP59.4HPRNet (Hourglass-104)
Pose EstimationCOCO-WholeBodykeypoint AP55.2HPRNet (DLA)
Hand Pose EstimationCOCO-WholeBodykeypoint AP50.4HPRNet (Hourglass-104)
Hand Pose EstimationCOCO-WholeBodykeypoint AP47HPRNet (DLA)
Facial Landmark DetectionCOCO-WholeBodykeypoint AP75.4HPRNet (Hourglass-104)
Facial Landmark DetectionCOCO-WholeBodykeypoint AP74.6HPRNet (DLA)
Face DetectionCOCO-WholeBodyAP56.4HPRNet (Hourglass-104)
Face DetectionCOCO-WholeBodyAP5082.4HPRNet (Hourglass-104)
Face DetectionCOCO-WholeBodyAP7567.1HPRNet (Hourglass-104)
Face DetectionCOCO-WholeBodyAPL63.3HPRNet (Hourglass-104)
Face DetectionCOCO-WholeBodyAPM43.4HPRNet (Hourglass-104)
Face DetectionCOCO-WholeBodyAP55.8HPRNet (DLA)
Face DetectionCOCO-WholeBodyAP5082.3HPRNet (DLA)
Face DetectionCOCO-WholeBodyAP7566.2HPRNet (DLA)
Face DetectionCOCO-WholeBodyAPL63.6HPRNet (DLA)
Face DetectionCOCO-WholeBodyAPM40HPRNet (DLA)
Face ReconstructionCOCO-WholeBodyAP56.4HPRNet (Hourglass-104)
Face ReconstructionCOCO-WholeBodyAP5082.4HPRNet (Hourglass-104)
Face ReconstructionCOCO-WholeBodyAP7567.1HPRNet (Hourglass-104)
Face ReconstructionCOCO-WholeBodyAPL63.3HPRNet (Hourglass-104)
Face ReconstructionCOCO-WholeBodyAPM43.4HPRNet (Hourglass-104)
Face ReconstructionCOCO-WholeBodyAP55.8HPRNet (DLA)
Face ReconstructionCOCO-WholeBodyAP5082.3HPRNet (DLA)
Face ReconstructionCOCO-WholeBodyAP7566.2HPRNet (DLA)
Face ReconstructionCOCO-WholeBodyAPL63.6HPRNet (DLA)
Face ReconstructionCOCO-WholeBodyAPM40HPRNet (DLA)
Face ReconstructionCOCO-WholeBodykeypoint AP75.4HPRNet (Hourglass-104)
Face ReconstructionCOCO-WholeBodykeypoint AP74.6HPRNet (DLA)
3DCOCO-WholeBodykeypoint AP50.4HPRNet (Hourglass-104)
3DCOCO-WholeBodykeypoint AP47HPRNet (DLA)
3DCOCO-WholeBodykeypoint AP59.4HPRNet (Hourglass-104)
3DCOCO-WholeBodykeypoint AP55.2HPRNet (DLA)
3DCOCO-WholeBodyAP56.4HPRNet (Hourglass-104)
3DCOCO-WholeBodyAP5082.4HPRNet (Hourglass-104)
3DCOCO-WholeBodyAP7567.1HPRNet (Hourglass-104)
3DCOCO-WholeBodyAPL63.3HPRNet (Hourglass-104)
3DCOCO-WholeBodyAPM43.4HPRNet (Hourglass-104)
3DCOCO-WholeBodyAP55.8HPRNet (DLA)
3DCOCO-WholeBodyAP5082.3HPRNet (DLA)
3DCOCO-WholeBodyAP7566.2HPRNet (DLA)
3DCOCO-WholeBodyAPL63.6HPRNet (DLA)
3DCOCO-WholeBodyAPM40HPRNet (DLA)
3DCOCO-WholeBodykeypoint AP75.4HPRNet (Hourglass-104)
3DCOCO-WholeBodykeypoint AP74.6HPRNet (DLA)
3D Face ModellingCOCO-WholeBodyAP56.4HPRNet (Hourglass-104)
3D Face ModellingCOCO-WholeBodyAP5082.4HPRNet (Hourglass-104)
3D Face ModellingCOCO-WholeBodyAP7567.1HPRNet (Hourglass-104)
3D Face ModellingCOCO-WholeBodyAPL63.3HPRNet (Hourglass-104)
3D Face ModellingCOCO-WholeBodyAPM43.4HPRNet (Hourglass-104)
3D Face ModellingCOCO-WholeBodyAP55.8HPRNet (DLA)
3D Face ModellingCOCO-WholeBodyAP5082.3HPRNet (DLA)
3D Face ModellingCOCO-WholeBodyAP7566.2HPRNet (DLA)
3D Face ModellingCOCO-WholeBodyAPL63.6HPRNet (DLA)
3D Face ModellingCOCO-WholeBodyAPM40HPRNet (DLA)
3D Face ModellingCOCO-WholeBodykeypoint AP75.4HPRNet (Hourglass-104)
3D Face ModellingCOCO-WholeBodykeypoint AP74.6HPRNet (DLA)
3D Face ReconstructionCOCO-WholeBodyAP56.4HPRNet (Hourglass-104)
3D Face ReconstructionCOCO-WholeBodyAP5082.4HPRNet (Hourglass-104)
3D Face ReconstructionCOCO-WholeBodyAP7567.1HPRNet (Hourglass-104)
3D Face ReconstructionCOCO-WholeBodyAPL63.3HPRNet (Hourglass-104)
3D Face ReconstructionCOCO-WholeBodyAPM43.4HPRNet (Hourglass-104)
3D Face ReconstructionCOCO-WholeBodyAP55.8HPRNet (DLA)
3D Face ReconstructionCOCO-WholeBodyAP5082.3HPRNet (DLA)
3D Face ReconstructionCOCO-WholeBodyAP7566.2HPRNet (DLA)
3D Face ReconstructionCOCO-WholeBodyAPL63.6HPRNet (DLA)
3D Face ReconstructionCOCO-WholeBodyAPM40HPRNet (DLA)
3D Face ReconstructionCOCO-WholeBodykeypoint AP75.4HPRNet (Hourglass-104)
3D Face ReconstructionCOCO-WholeBodykeypoint AP74.6HPRNet (DLA)
2D Human Pose EstimationCOCO-WholeBodyWB34.8HPRNet
2D Human Pose EstimationCOCO-WholeBodybody59.4HPRNet
2D Human Pose EstimationCOCO-WholeBodyface75.4HPRNet
2D Human Pose EstimationCOCO-WholeBodyfoot53HPRNet
2D Human Pose EstimationCOCO-WholeBodyhand50.4HPRNet
Multi-Person Pose EstimationCOCO-WholeBodykeypoint AP59.4HPRNet (Hourglass-104)
Multi-Person Pose EstimationCOCO-WholeBodykeypoint AP55.2HPRNet (DLA)
1 Image, 2*2 StitchiCOCO-WholeBodykeypoint AP50.4HPRNet (Hourglass-104)
1 Image, 2*2 StitchiCOCO-WholeBodykeypoint AP47HPRNet (DLA)
1 Image, 2*2 StitchiCOCO-WholeBodykeypoint AP59.4HPRNet (Hourglass-104)
1 Image, 2*2 StitchiCOCO-WholeBodykeypoint AP55.2HPRNet (DLA)

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16