TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Towards High Performance Human Keypoint Detection

Towards High Performance Human Keypoint Detection

Jing Zhang, Zhe Chen, DaCheng Tao

2020-02-03Human DetectionVocal Bursts Intensity PredictionPose EstimationKeypoint Detection
PaperPDFCode(official)

Abstract

Human keypoint detection from a single image is very challenging due to occlusion, blur, illumination and scale variance. In this paper, we address this problem from three aspects by devising an efficient network structure, proposing three effective training strategies, and exploiting four useful postprocessing techniques. First, we find that context information plays an important role in reasoning human body configuration and invisible keypoints. Inspired by this, we propose a cascaded context mixer (CCM), which efficiently integrates spatial and channel context information and progressively refines them. Then, to maximize CCM's representation capability, we develop a hard-negative person detection mining strategy and a joint-training strategy by exploiting abundant unlabeled data. It enables CCM to learn discriminative features from massive diverse poses. Third, we present several sub-pixel refinement techniques for postprocessing keypoint predictions to improve detection accuracy. Extensive experiments on the MS COCO keypoint detection benchmark demonstrate the superiority of the proposed method over representative state-of-the-art (SOTA) methods. Our single model achieves comparable performance with the winner of the 2018 COCO Keypoint Detection Challenge. The final ensemble model sets a new SOTA on this benchmark.

Results

TaskDatasetMetricValueModel
Pose EstimationCOCO test-devAP78.9CCM+
Pose EstimationCOCO test-devAP5093.8CCM+
Pose EstimationCOCO test-devAP7586CCM+
Pose EstimationCOCO test-devAPL84.5CCM+
Pose EstimationCOCO test-devAPM75CCM+
Pose EstimationCOCO test-devAR83.6CCM+
3DCOCO test-devAP78.9CCM+
3DCOCO test-devAP5093.8CCM+
3DCOCO test-devAP7586CCM+
3DCOCO test-devAPL84.5CCM+
3DCOCO test-devAPM75CCM+
3DCOCO test-devAR83.6CCM+
1 Image, 2*2 StitchiCOCO test-devAP78.9CCM+
1 Image, 2*2 StitchiCOCO test-devAP5093.8CCM+
1 Image, 2*2 StitchiCOCO test-devAP7586CCM+
1 Image, 2*2 StitchiCOCO test-devAPL84.5CCM+
1 Image, 2*2 StitchiCOCO test-devAPM75CCM+
1 Image, 2*2 StitchiCOCO test-devAR83.6CCM+

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16