TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Delicate Local Representations for Multi-Person P...

Learning Delicate Local Representations for Multi-Person Pose Estimation

Yuanhao Cai, Zhicheng Wang, Zhengxiong Luo, Binyi Yin, Angang Du, Haoqian Wang, Xiangyu Zhang, Xinyu Zhou, Erjin Zhou, Jian Sun

2020-03-09ECCV 2020 8Pose EstimationMulti-Person Pose EstimationKeypoint Detection
PaperPDFCodeCodeCodeCode(official)

Abstract

In this paper, we propose a novel method called Residual Steps Network (RSN). RSN aggregates features with the same spatial size (Intra-level features) efficiently to obtain delicate local representations, which retain rich low-level spatial information and result in precise keypoint localization. Additionally, we observe the output features contribute differently to final performance. To tackle this problem, we propose an efficient attention mechanism - Pose Refine Machine (PRM) to make a trade-off between local and global representations in output features and further refine the keypoint locations. Our approach won the 1st place of COCO Keypoint Challenge 2019 and achieves state-of-the-art results on both COCO and MPII benchmarks, without using extra training data and pretrained model. Our single model achieves 78.6 on COCO test-dev, 93.0 on MPII test dataset. Ensembled models achieve 79.2 on COCO test-dev, 77.1 on COCO test-challenge dataset. The source code is publicly available for further research at https://github.com/caiyuanhao1998/RSN/

Results

TaskDatasetMetricValueModel
Pose EstimationCOCO test-devAP79.24xRSN-50 (ensemble)
Pose EstimationCOCO test-devAP5094.44xRSN-50 (ensemble)
Pose EstimationCOCO test-devAP7587.14xRSN-50 (ensemble)
Pose EstimationCOCO test-devAPL76.14xRSN-50 (ensemble)
Pose EstimationCOCO test-devAPM83.84xRSN-50 (ensemble)
Pose EstimationCOCO test-devAR84.14xRSN-50 (ensemble)
Pose EstimationCOCO test-devAP78.64xRSN-50
Pose EstimationCOCO test-devAP5094.34xRSN-50
Pose EstimationCOCO test-devAP7586.64xRSN-50
Pose EstimationCOCO test-devAPL75.54xRSN-50
Pose EstimationCOCO test-devAPM83.34xRSN-50
Pose EstimationCOCO test-devAR83.84xRSN-50
Pose EstimationMPII Human PosePCKh-0.5934xRSN-50
Pose EstimationMPII Single PersonPCKh@0.5934xRSN-50
Pose EstimationCOCO (Common Objects in Context)Test AP78.64xRSN-50(384×288)
Pose EstimationCOCO test-challengeAP77.14×RSN-50
Pose EstimationCOCO test-challengeAP5093.34×RSN-50
Pose EstimationCOCO test-challengeAP7583.64×RSN-50
Pose EstimationCOCO test-challengeAPL82.64×RSN-50
Pose EstimationCOCO test-challengeAR82.64×RSN-50
Pose EstimationCOCO test-challengeAR5096.14×RSN-50
Pose EstimationCOCO test-challengeAR7588.24×RSN-50
Pose EstimationCOCO test-challengeARL88.74×RSN-50
Pose EstimationCOCO test-challengeARM784×RSN-50
Pose EstimationCOCO (Common Objects in Context)AP0.792RSN
3DCOCO test-devAP79.24xRSN-50 (ensemble)
3DCOCO test-devAP5094.44xRSN-50 (ensemble)
3DCOCO test-devAP7587.14xRSN-50 (ensemble)
3DCOCO test-devAPL76.14xRSN-50 (ensemble)
3DCOCO test-devAPM83.84xRSN-50 (ensemble)
3DCOCO test-devAR84.14xRSN-50 (ensemble)
3DCOCO test-devAP78.64xRSN-50
3DCOCO test-devAP5094.34xRSN-50
3DCOCO test-devAP7586.64xRSN-50
3DCOCO test-devAPL75.54xRSN-50
3DCOCO test-devAPM83.34xRSN-50
3DCOCO test-devAR83.84xRSN-50
3DMPII Human PosePCKh-0.5934xRSN-50
3DMPII Single PersonPCKh@0.5934xRSN-50
3DCOCO (Common Objects in Context)Test AP78.64xRSN-50(384×288)
3DCOCO test-challengeAP77.14×RSN-50
3DCOCO test-challengeAP5093.34×RSN-50
3DCOCO test-challengeAP7583.64×RSN-50
3DCOCO test-challengeAPL82.64×RSN-50
3DCOCO test-challengeAR82.64×RSN-50
3DCOCO test-challengeAR5096.14×RSN-50
3DCOCO test-challengeAR7588.24×RSN-50
3DCOCO test-challengeARL88.74×RSN-50
3DCOCO test-challengeARM784×RSN-50
3DCOCO (Common Objects in Context)AP0.792RSN
Multi-Person Pose EstimationCOCO (Common Objects in Context)AP0.792RSN
1 Image, 2*2 StitchiCOCO test-devAP79.24xRSN-50 (ensemble)
1 Image, 2*2 StitchiCOCO test-devAP5094.44xRSN-50 (ensemble)
1 Image, 2*2 StitchiCOCO test-devAP7587.14xRSN-50 (ensemble)
1 Image, 2*2 StitchiCOCO test-devAPL76.14xRSN-50 (ensemble)
1 Image, 2*2 StitchiCOCO test-devAPM83.84xRSN-50 (ensemble)
1 Image, 2*2 StitchiCOCO test-devAR84.14xRSN-50 (ensemble)
1 Image, 2*2 StitchiCOCO test-devAP78.64xRSN-50
1 Image, 2*2 StitchiCOCO test-devAP5094.34xRSN-50
1 Image, 2*2 StitchiCOCO test-devAP7586.64xRSN-50
1 Image, 2*2 StitchiCOCO test-devAPL75.54xRSN-50
1 Image, 2*2 StitchiCOCO test-devAPM83.34xRSN-50
1 Image, 2*2 StitchiCOCO test-devAR83.84xRSN-50
1 Image, 2*2 StitchiMPII Human PosePCKh-0.5934xRSN-50
1 Image, 2*2 StitchiMPII Single PersonPCKh@0.5934xRSN-50
1 Image, 2*2 StitchiCOCO (Common Objects in Context)Test AP78.64xRSN-50(384×288)
1 Image, 2*2 StitchiCOCO test-challengeAP77.14×RSN-50
1 Image, 2*2 StitchiCOCO test-challengeAP5093.34×RSN-50
1 Image, 2*2 StitchiCOCO test-challengeAP7583.64×RSN-50
1 Image, 2*2 StitchiCOCO test-challengeAPL82.64×RSN-50
1 Image, 2*2 StitchiCOCO test-challengeAR82.64×RSN-50
1 Image, 2*2 StitchiCOCO test-challengeAR5096.14×RSN-50
1 Image, 2*2 StitchiCOCO test-challengeAR7588.24×RSN-50
1 Image, 2*2 StitchiCOCO test-challengeARL88.74×RSN-50
1 Image, 2*2 StitchiCOCO test-challengeARM784×RSN-50
1 Image, 2*2 StitchiCOCO (Common Objects in Context)AP0.792RSN

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16