TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Convolutional Pose Machines

Convolutional Pose Machines

Shih-En Wei, Varun Ramakrishna, Takeo Kanade, Yaser Sheikh

2016-01-30CVPR 2016 63D Human Pose EstimationStructured PredictionCar Pose EstimationPose EstimationClassification
PaperPDFCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Pose Machines provide a sequential prediction framework for learning rich implicit spatial models. In this work we show a systematic design for how convolutional networks can be incorporated into the pose machine framework for learning image features and image-dependent spatial models for the task of pose estimation. The contribution of this paper is to implicitly model long-range dependencies between variables in structured prediction tasks such as articulated pose estimation. We achieve this by designing a sequential architecture composed of convolutional networks that directly operate on belief maps from previous stages, producing increasingly refined estimates for part locations, without the need for explicit graphical model-style inference. Our approach addresses the characteristic difficulty of vanishing gradients during training by providing a natural learning objective function that enforces intermediate supervision, thereby replenishing back-propagated gradients and conditioning the learning procedure. We demonstrate state-of-the-art performance and outperform competing methods on standard benchmarks including the MPII, LSP, and FLIC datasets.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationTotal CaptureAverage MPJPE (mm)99Tri-CPM
Pose EstimationJ-HMDBMean PCK@0.291.9CPM
Pose EstimationMPII Human PosePCKh-0.588.52Convolutional Pose Machines
Pose EstimationTotal CaptureAverage MPJPE (mm)99Tri-CPM
Pose EstimationApolloCar3DDetection Rate75.4CPM
3DJ-HMDBMean PCK@0.291.9CPM
3DMPII Human PosePCKh-0.588.52Convolutional Pose Machines
3DTotal CaptureAverage MPJPE (mm)99Tri-CPM
3DApolloCar3DDetection Rate75.4CPM
ClassificationRSSCN71:1 Accuracy50CPM
1 Image, 2*2 StitchiJ-HMDBMean PCK@0.291.9CPM
1 Image, 2*2 StitchiMPII Human PosePCKh-0.588.52Convolutional Pose Machines
1 Image, 2*2 StitchiTotal CaptureAverage MPJPE (mm)99Tri-CPM
1 Image, 2*2 StitchiApolloCar3DDetection Rate75.4CPM

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16