TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Single-Stage Multi-Person Pose Machines

Single-Stage Multi-Person Pose Machines

Xuecheng Nie, Jianfeng Zhang, Shuicheng Yan, Jiashi Feng

2019-08-24ICCV 2019 10Pose EstimationMulti-Person Pose EstimationKeypoint Detection3D Pose Estimation
PaperPDFCode

Abstract

Multi-person pose estimation is a challenging problem. Existing methods are mostly two-stage based--one stage for proposal generation and the other for allocating poses to corresponding persons. However, such two-stage methods generally suffer low efficiency. In this work, we present the first single-stage model, Single-stage multi-person Pose Machine (SPM), to simplify the pipeline and lift the efficiency for multi-person pose estimation. To achieve this, we propose a novel Structured Pose Representation (SPR) that unifies person instance and body joint position representations. Based on SPR, we develop the SPM model that can directly predict structured poses for multiple persons in a single stage, and thus offer a more compact pipeline and attractive efficiency advantage over two-stage methods. In particular, SPR introduces the root joints to indicate different person instances and human body joint positions are encoded into their displacements w.r.t. the roots. To better predict long-range displacements for some joints, SPR is further extended to hierarchical representations. Based on SPR, SPM can efficiently perform multi-person poses estimation by simultaneously predicting root joints (location of instances) and body joint displacements via CNNs. Moreover, to demonstrate the generality of SPM, we also apply it to multi-person 3D pose estimation. Comprehensive experiments on benchmarks MPII, extended PASCAL-Person-Part, MSCOCO and CMU Panoptic clearly demonstrate the state-of-the-art efficiency of SPM for multi-person 2D/3D pose estimation, together with outstanding accuracy.

Results

TaskDatasetMetricValueModel
Pose EstimationCOCO test-devAP66.9SPM
Pose EstimationCOCO test-devAP5088.5SPM
Pose EstimationCOCO test-devAP7572.9SPM
Pose EstimationCOCO test-devAPL73.1SPM
Pose EstimationCOCO test-devAPM62.6SPM
Pose EstimationCrowdPoseAP Easy70.3SPM
Pose EstimationCrowdPoseAP Hard55.7SPM
Pose EstimationCrowdPoseAP Medium64.5SPM
Pose EstimationCrowdPosemAP @0.5:0.9563.7SPM
Pose EstimationOCHumanAP5067.5SPM
Pose EstimationOCHumanAP7553.2SPM
Pose EstimationOCHumanValidation AP47.6SPM
3DCOCO test-devAP66.9SPM
3DCOCO test-devAP5088.5SPM
3DCOCO test-devAP7572.9SPM
3DCOCO test-devAPL73.1SPM
3DCOCO test-devAPM62.6SPM
3DCrowdPoseAP Easy70.3SPM
3DCrowdPoseAP Hard55.7SPM
3DCrowdPoseAP Medium64.5SPM
3DCrowdPosemAP @0.5:0.9563.7SPM
3DOCHumanAP5067.5SPM
3DOCHumanAP7553.2SPM
3DOCHumanValidation AP47.6SPM
Multi-Person Pose EstimationCOCO test-devAP66.9SPM
Multi-Person Pose EstimationCOCO test-devAP5088.5SPM
Multi-Person Pose EstimationCOCO test-devAP7572.9SPM
Multi-Person Pose EstimationCOCO test-devAPL73.1SPM
Multi-Person Pose EstimationCOCO test-devAPM62.6SPM
Multi-Person Pose EstimationCrowdPoseAP Easy70.3SPM
Multi-Person Pose EstimationCrowdPoseAP Hard55.7SPM
Multi-Person Pose EstimationCrowdPoseAP Medium64.5SPM
Multi-Person Pose EstimationCrowdPosemAP @0.5:0.9563.7SPM
Multi-Person Pose EstimationOCHumanAP5067.5SPM
Multi-Person Pose EstimationOCHumanAP7553.2SPM
Multi-Person Pose EstimationOCHumanValidation AP47.6SPM
1 Image, 2*2 StitchiCOCO test-devAP66.9SPM
1 Image, 2*2 StitchiCOCO test-devAP5088.5SPM
1 Image, 2*2 StitchiCOCO test-devAP7572.9SPM
1 Image, 2*2 StitchiCOCO test-devAPL73.1SPM
1 Image, 2*2 StitchiCOCO test-devAPM62.6SPM
1 Image, 2*2 StitchiCrowdPoseAP Easy70.3SPM
1 Image, 2*2 StitchiCrowdPoseAP Hard55.7SPM
1 Image, 2*2 StitchiCrowdPoseAP Medium64.5SPM
1 Image, 2*2 StitchiCrowdPosemAP @0.5:0.9563.7SPM
1 Image, 2*2 StitchiOCHumanAP5067.5SPM
1 Image, 2*2 StitchiOCHumanAP7553.2SPM
1 Image, 2*2 StitchiOCHumanValidation AP47.6SPM

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16