TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Self-Supervised Learning of 3D Human Pose using Multi-view...

Self-Supervised Learning of 3D Human Pose using Multi-view Geometry

Muhammed Kocabas, Salih Karagoz, Emre Akbas

2019-03-06CVPR 2019 63D Human Pose EstimationWeakly-supervised 3D Human Pose EstimationSelf-Supervised LearningPose Estimation
PaperPDFCode(official)

Abstract

Training accurate 3D human pose estimators requires large amount of 3D ground-truth data which is costly to collect. Various weakly or self supervised pose estimation methods have been proposed due to lack of 3D data. Nevertheless, these methods, in addition to 2D ground-truth poses, require either additional supervision in various forms (e.g. unpaired 3D ground truth data, a small subset of labels) or the camera parameters in multiview settings. To address these problems, we present EpipolarPose, a self-supervised learning method for 3D human pose estimation, which does not need any 3D ground-truth data or camera extrinsics. During training, EpipolarPose estimates 2D poses from multi-view images, and then, utilizes epipolar geometry to obtain a 3D pose and camera geometry which are subsequently used to train a 3D pose estimator. We demonstrate the effectiveness of our approach on standard benchmark datasets i.e. Human3.6M and MPI-INF-3DHP where we set the new state-of-the-art among weakly/self-supervised methods. Furthermore, we propose a new performance measure Pose Structure Score (PSS) which is a scale invariant, structure aware measure to evaluate the structural plausibility of a pose with respect to its ground truth. Code and pretrained models are available at https://github.com/mkocabas/EpipolarPose

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPMPJPE108.99EpipolarPose (fully-supervised)
3D Human Pose EstimationMPI-INF-3DHPPCK77.5EpipolarPose (fully-supervised)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)60.56EpipolarPose (SS + RU)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)65.35EpipolarPose (S1)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)76.6EpipolarPose (self-supervised)
3D Human Pose EstimationHuman3.6MNumber of Frames Per View1Kocabas et al.
3D Human Pose EstimationHuman3.6MNumber of Views2Kocabas et al.
Pose EstimationMPI-INF-3DHPMPJPE108.99EpipolarPose (fully-supervised)
Pose EstimationMPI-INF-3DHPPCK77.5EpipolarPose (fully-supervised)
Pose EstimationHuman3.6MAverage MPJPE (mm)60.56EpipolarPose (SS + RU)
Pose EstimationHuman3.6MAverage MPJPE (mm)65.35EpipolarPose (S1)
Pose EstimationHuman3.6MAverage MPJPE (mm)76.6EpipolarPose (self-supervised)
Pose EstimationHuman3.6MNumber of Frames Per View1Kocabas et al.
Pose EstimationHuman3.6MNumber of Views2Kocabas et al.
3DMPI-INF-3DHPMPJPE108.99EpipolarPose (fully-supervised)
3DMPI-INF-3DHPPCK77.5EpipolarPose (fully-supervised)
3DHuman3.6MAverage MPJPE (mm)60.56EpipolarPose (SS + RU)
3DHuman3.6MAverage MPJPE (mm)65.35EpipolarPose (S1)
3DHuman3.6MAverage MPJPE (mm)76.6EpipolarPose (self-supervised)
3DHuman3.6MNumber of Frames Per View1Kocabas et al.
3DHuman3.6MNumber of Views2Kocabas et al.
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE108.99EpipolarPose (fully-supervised)
1 Image, 2*2 StitchiMPI-INF-3DHPPCK77.5EpipolarPose (fully-supervised)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)60.56EpipolarPose (SS + RU)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)65.35EpipolarPose (S1)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)76.6EpipolarPose (self-supervised)
1 Image, 2*2 StitchiHuman3.6MNumber of Frames Per View1Kocabas et al.
1 Image, 2*2 StitchiHuman3.6MNumber of Views2Kocabas et al.

Related Papers

A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16