TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/3D human pose estimation in video with temporal convolutio...

3D human pose estimation in video with temporal convolutions and semi-supervised training

Dario Pavllo, Christoph Feichtenhofer, David Grangier, Michael Auli

2018-11-28CVPR 2019 63D Human Pose EstimationWeakly-supervised 3D Human Pose EstimationMonocular 3D Human Pose EstimationPose Estimation
PaperPDFCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCode

Abstract

In this work, we demonstrate that 3D poses in video can be effectively estimated with a fully convolutional model based on dilated temporal convolutions over 2D keypoints. We also introduce back-projection, a simple and effective semi-supervised training method that leverages unlabeled video data. We start with predicted 2D keypoints for unlabeled video, then estimate 3D poses and finally back-project to the input 2D keypoints. In the supervised setting, our fully-convolutional model outperforms the previous best result from the literature by 6 mm mean per-joint position error on Human3.6M, corresponding to an error reduction of 11%, and the model also shows significant improvements on HumanEva-I. Moreover, experiments with back-projection show that it comfortably outperforms previous state-of-the-art results in semi-supervised settings where labeled data is scarce. Code and models are available at https://github.com/facebookresearch/VideoPose3D

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)46.8VideoPose3D (T=243)
3D Human Pose EstimationHuman3.6MPA-MPJPE36.5VideoPose3D (T=243)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)51.8VideoPose3D (T=1)
3D Human Pose EstimationHuman3.6MPA-MPJPE40VideoPose3D (T=1)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)46.8VideoPose3D (T=243)
3D Human Pose EstimationHuman3.6MFrames Needed243VideoPose3D (T=243)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)64.7Pavllo et al.
3D Human Pose EstimationHuman3.6MNumber of Views1Pavllo et al.
3D Human Pose EstimationHuman3.6MNumber of Frames Per View243VideoPose3D (T=243)
Pose EstimationHuman3.6MAverage MPJPE (mm)46.8VideoPose3D (T=243)
Pose EstimationHuman3.6MPA-MPJPE36.5VideoPose3D (T=243)
Pose EstimationHuman3.6MAverage MPJPE (mm)51.8VideoPose3D (T=1)
Pose EstimationHuman3.6MPA-MPJPE40VideoPose3D (T=1)
Pose EstimationHuman3.6MAverage MPJPE (mm)46.8VideoPose3D (T=243)
Pose EstimationHuman3.6MFrames Needed243VideoPose3D (T=243)
Pose EstimationHuman3.6MAverage MPJPE (mm)64.7Pavllo et al.
Pose EstimationHuman3.6MNumber of Views1Pavllo et al.
Pose EstimationHuman3.6MNumber of Frames Per View243VideoPose3D (T=243)
3DHuman3.6MAverage MPJPE (mm)46.8VideoPose3D (T=243)
3DHuman3.6MPA-MPJPE36.5VideoPose3D (T=243)
3DHuman3.6MAverage MPJPE (mm)51.8VideoPose3D (T=1)
3DHuman3.6MPA-MPJPE40VideoPose3D (T=1)
3DHuman3.6MAverage MPJPE (mm)46.8VideoPose3D (T=243)
3DHuman3.6MFrames Needed243VideoPose3D (T=243)
3DHuman3.6MAverage MPJPE (mm)64.7Pavllo et al.
3DHuman3.6MNumber of Views1Pavllo et al.
3DHuman3.6MNumber of Frames Per View243VideoPose3D (T=243)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)46.8VideoPose3D (T=243)
1 Image, 2*2 StitchiHuman3.6MPA-MPJPE36.5VideoPose3D (T=243)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)51.8VideoPose3D (T=1)
1 Image, 2*2 StitchiHuman3.6MPA-MPJPE40VideoPose3D (T=1)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)46.8VideoPose3D (T=243)
1 Image, 2*2 StitchiHuman3.6MFrames Needed243VideoPose3D (T=243)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)64.7Pavllo et al.
1 Image, 2*2 StitchiHuman3.6MNumber of Views1Pavllo et al.
1 Image, 2*2 StitchiHuman3.6MNumber of Frames Per View243VideoPose3D (T=243)

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16