TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Self-Constrained Inference Optimization on Structural Grou...

Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation

Zhehan Kan, Shuoshuo Chen, Zeng Li, Zhihai He

2022-07-06PredictionPose EstimationMulti-Person Pose EstimationPose Prediction
PaperPDF

Abstract

We observe that human poses exhibit strong group-wise structural correlation and spatial coupling between keypoints due to the biological constraints of different body parts. This group-wise structural correlation can be explored to improve the accuracy and robustness of human pose estimation. In this work, we develop a self-constrained prediction-verification network to characterize and learn the structural correlation between keypoints during training. During the inference stage, the feedback information from the verification network allows us to perform further optimization of pose prediction, which significantly improves the performance of human pose estimation. Specifically, we partition the keypoints into groups according to the biological structure of human body. Within each group, the keypoints are further partitioned into two subsets, high-confidence base keypoints and low-confidence terminal keypoints. We develop a self-constrained prediction-verification network to perform forward and backward predictions between these keypoint subsets. One fundamental challenge in pose estimation, as well as in generic prediction tasks, is that there is no mechanism for us to verify if the obtained pose estimation or prediction results are accurate or not, since the ground truth is not available. Once successfully learned, the verification network serves as an accuracy verification module for the forward pose prediction. During the inference stage, it can be used to guide the local optimization of the pose estimation results of low-confidence keypoints with the self-constrained loss on high-confidence keypoints as the objective function. Our extensive experimental results on benchmark MS COCO and CrowdPose datasets demonstrate that the proposed method can significantly improve the pose estimation results.

Results

TaskDatasetMetricValueModel
Pose EstimationCOCO test-devAP79.2SCIO (HRNet-48)
Pose EstimationCOCO test-devAP5093.5SCIO (HRNet-48)
Pose EstimationCOCO test-devAP7585.8SCIO (HRNet-48)
Pose EstimationCOCO test-devAPL84.2SCIO (HRNet-48)
Pose EstimationCOCO test-devAPM74.1SCIO (HRNet-48)
Pose EstimationCOCO test-devAR81.6SCIO (HRNet-48)
Pose EstimationCrowdPoseAP Medium72.2SCIO (HRNet-48)
Pose EstimationCrowdPosemAP @0.5:0.9571.5SCIO (HRNet-48)
3DCOCO test-devAP79.2SCIO (HRNet-48)
3DCOCO test-devAP5093.5SCIO (HRNet-48)
3DCOCO test-devAP7585.8SCIO (HRNet-48)
3DCOCO test-devAPL84.2SCIO (HRNet-48)
3DCOCO test-devAPM74.1SCIO (HRNet-48)
3DCOCO test-devAR81.6SCIO (HRNet-48)
3DCrowdPoseAP Medium72.2SCIO (HRNet-48)
3DCrowdPosemAP @0.5:0.9571.5SCIO (HRNet-48)
Multi-Person Pose EstimationCOCO test-devAP79.2SCIO (HRNet-48)
Multi-Person Pose EstimationCOCO test-devAP5093.5SCIO (HRNet-48)
Multi-Person Pose EstimationCOCO test-devAP7585.8SCIO (HRNet-48)
Multi-Person Pose EstimationCOCO test-devAPL84.2SCIO (HRNet-48)
Multi-Person Pose EstimationCOCO test-devAPM74.1SCIO (HRNet-48)
Multi-Person Pose EstimationCOCO test-devAR81.6SCIO (HRNet-48)
Multi-Person Pose EstimationCrowdPoseAP Medium72.2SCIO (HRNet-48)
Multi-Person Pose EstimationCrowdPosemAP @0.5:0.9571.5SCIO (HRNet-48)
1 Image, 2*2 StitchiCOCO test-devAP79.2SCIO (HRNet-48)
1 Image, 2*2 StitchiCOCO test-devAP5093.5SCIO (HRNet-48)
1 Image, 2*2 StitchiCOCO test-devAP7585.8SCIO (HRNet-48)
1 Image, 2*2 StitchiCOCO test-devAPL84.2SCIO (HRNet-48)
1 Image, 2*2 StitchiCOCO test-devAPM74.1SCIO (HRNet-48)
1 Image, 2*2 StitchiCOCO test-devAR81.6SCIO (HRNet-48)
1 Image, 2*2 StitchiCrowdPoseAP Medium72.2SCIO (HRNet-48)
1 Image, 2*2 StitchiCrowdPosemAP @0.5:0.9571.5SCIO (HRNet-48)

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16