TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TriPose: A Weakly-Supervised 3D Human Pose Estimation via ...

TriPose: A Weakly-Supervised 3D Human Pose Estimation via Triangulation from Video

Mohsen Gholami, Ahmad Rezaei, Helge Rhodin, Rabab Ward, Z. Jane Wang

2021-05-143D Human Pose EstimationWeakly-supervised 3D Human Pose EstimationPose Estimation
PaperPDF

Abstract

Estimating 3D human poses from video is a challenging problem. The lack of 3D human pose annotations is a major obstacle for supervised training and for generalization to unseen datasets. In this work, we address this problem by proposing a weakly-supervised training scheme that does not require 3D annotations or calibrated cameras. The proposed method relies on temporal information and triangulation. Using 2D poses from multiple views as the input, we first estimate the relative camera orientations and then generate 3D poses via triangulation. The triangulation is only applied to the views with high 2D human joint confidence. The generated 3D poses are then used to train a recurrent lifting network (RLN) that estimates 3D poses from 2D poses. We further apply a multi-view re-projection loss to the estimated 3D poses and enforce the 3D poses estimated from multi-views to be consistent. Therefore, our method relaxes the constraints in practice, only multi-view videos are required for training, and is thus convenient for in-the-wild settings. At inference, RLN merely requires single-view videos. The proposed method outperforms previous works on two challenging datasets, Human3.6M and MPI-INF-3DHP. Codes and pretrained models will be publicly available.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)62.9TriPose
3D Human Pose EstimationHuman3.6MNumber of Frames Per View27TriPose
3D Human Pose EstimationHuman3.6MNumber of Views1TriPose
Pose EstimationHuman3.6MAverage MPJPE (mm)62.9TriPose
Pose EstimationHuman3.6MNumber of Frames Per View27TriPose
Pose EstimationHuman3.6MNumber of Views1TriPose
3DHuman3.6MAverage MPJPE (mm)62.9TriPose
3DHuman3.6MNumber of Frames Per View27TriPose
3DHuman3.6MNumber of Views1TriPose
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)62.9TriPose
1 Image, 2*2 StitchiHuman3.6MNumber of Frames Per View27TriPose
1 Image, 2*2 StitchiHuman3.6MNumber of Views1TriPose

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16