TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Distribution-Aware Coordinate Representation for Human Pos...

Distribution-Aware Coordinate Representation for Human Pose Estimation

Feng Zhang, Xiatian Zhu, Hanbin Dai, Mao Ye, Ce Zhu

2019-10-14CVPR 2020 6Pose EstimationMulti-Person Pose EstimationKeypoint Detection
PaperPDFCodeCodeCodeCodeCodeCode(official)

Abstract

While being the de facto standard coordinate representation in human pose estimation, heatmap is never systematically investigated in the literature, to our best knowledge. This work fills this gap by studying the coordinate representation with a particular focus on the heatmap. Interestingly, we found that the process of decoding the predicted heatmaps into the final joint coordinates in the original image space is surprisingly significant for human pose estimation performance, which nevertheless was not recognised before. In light of the discovered importance, we further probe the design limitations of the standard coordinate decoding method widely used by existing methods, and propose a more principled distribution-aware decoding method. Meanwhile, we improve the standard coordinate encoding process (i.e. transforming ground-truth coordinates to heatmaps) by generating accurate heatmap distributions for unbiased model training. Taking the two together, we formulate a novel Distribution-Aware coordinate Representation of Keypoint (DARK) method. Serving as a model-agnostic plug-in, DARK significantly improves the performance of a variety of state-of-the-art human pose estimation models. Extensive experiments show that DARK yields the best results on two common benchmarks, MPII and COCO, consistently validating the usefulness and effectiveness of our novel coordinate representation idea.

Results

TaskDatasetMetricValueModel
Pose EstimationCOCO test-devAP77.4HRNet-W48+DARK
Pose EstimationCOCO test-devAP5092.6HRNet-W48+DARK
Pose EstimationCOCO test-devAP7584.6HRNet-W48+DARK
Pose EstimationCOCO test-devAPL83.7HRNet-W48+DARK
Pose EstimationCOCO test-devAPM73.6HRNet-W48+DARK
Pose EstimationCOCO test-devAR82.3HRNet-W48+DARK
Pose EstimationMPII Human PosePCKh-0.590.6DarkPose
Pose EstimationCOCO (Common Objects in Context)Test AP76.2DarkPose(384x288)
Pose EstimationCOCO (Common Objects in Context)AP0.774DarkPose
3DCOCO test-devAP77.4HRNet-W48+DARK
3DCOCO test-devAP5092.6HRNet-W48+DARK
3DCOCO test-devAP7584.6HRNet-W48+DARK
3DCOCO test-devAPL83.7HRNet-W48+DARK
3DCOCO test-devAPM73.6HRNet-W48+DARK
3DCOCO test-devAR82.3HRNet-W48+DARK
3DMPII Human PosePCKh-0.590.6DarkPose
3DCOCO (Common Objects in Context)Test AP76.2DarkPose(384x288)
3DCOCO (Common Objects in Context)AP0.774DarkPose
Multi-Person Pose EstimationCOCO (Common Objects in Context)AP0.774DarkPose
1 Image, 2*2 StitchiCOCO test-devAP77.4HRNet-W48+DARK
1 Image, 2*2 StitchiCOCO test-devAP5092.6HRNet-W48+DARK
1 Image, 2*2 StitchiCOCO test-devAP7584.6HRNet-W48+DARK
1 Image, 2*2 StitchiCOCO test-devAPL83.7HRNet-W48+DARK
1 Image, 2*2 StitchiCOCO test-devAPM73.6HRNet-W48+DARK
1 Image, 2*2 StitchiCOCO test-devAR82.3HRNet-W48+DARK
1 Image, 2*2 StitchiMPII Human PosePCKh-0.590.6DarkPose
1 Image, 2*2 StitchiCOCO (Common Objects in Context)Test AP76.2DarkPose(384x288)
1 Image, 2*2 StitchiCOCO (Common Objects in Context)AP0.774DarkPose

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16