TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/VIBE: Video Inference for Human Body Pose and Shape Estima...

VIBE: Video Inference for Human Body Pose and Shape Estimation

Muhammed Kocabas, Nikos Athanasiou, Michael J. Black

2019-12-11CVPR 2020 63D Human Pose Estimation3D Shape ReconstructionMonocular 3D Human Pose EstimationPose Estimation3D Pose Estimation
PaperPDFCodeCodeCodeCode(official)Code

Abstract

Human motion is fundamental to understanding behavior. Despite progress on single-image 3D pose and shape estimation, existing video-based state-of-the-art methods fail to produce accurate and natural motion sequences due to a lack of ground-truth 3D motion data for training. To address this problem, we propose Video Inference for Body Pose and Shape Estimation (VIBE), which makes use of an existing large-scale motion capture dataset (AMASS) together with unpaired, in-the-wild, 2D keypoint annotations. Our key novelty is an adversarial learning framework that leverages AMASS to discriminate between real human motions and those produced by our temporal pose and shape regression networks. We define a temporal network architecture and show that adversarial training, at the sequence level, produces kinematically plausible motion sequences without in-the-wild ground-truth 3D labels. We perform extensive experimentation to analyze the importance of motion and demonstrate the effectiveness of VIBE on challenging 3D pose estimation datasets, achieving state-of-the-art performance. Code and pretrained models are available at https://github.com/mkocabas/VIBE.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPMPJPE96.6VIBE
3D Human Pose EstimationMPI-INF-3DHPPA-MPJPE64.6VIBE
3D Human Pose EstimationMPI-INF-3DHPPCK89.3VIBE
3D Human Pose Estimation3DPWAcceleration Error23.4VIBE
3D Human Pose Estimation3DPWMPJPE82.9VIBE
3D Human Pose Estimation3DPWMPVPE99.1VIBE
3D Human Pose Estimation3DPWNumber of parameters (M)72.43VIBE
3D Human Pose Estimation3DPWPA-MPJPE51.9VIBE
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)65.6VIBE
3D Human Pose EstimationHuman3.6MPA-MPJPE41.4VIBE
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)65.6VIBE
3D Human Pose EstimationHuman3.6MFrames Needed16VIBE
Pose EstimationMPI-INF-3DHPMPJPE96.6VIBE
Pose EstimationMPI-INF-3DHPPA-MPJPE64.6VIBE
Pose EstimationMPI-INF-3DHPPCK89.3VIBE
Pose Estimation3DPWAcceleration Error23.4VIBE
Pose Estimation3DPWMPJPE82.9VIBE
Pose Estimation3DPWMPVPE99.1VIBE
Pose Estimation3DPWNumber of parameters (M)72.43VIBE
Pose Estimation3DPWPA-MPJPE51.9VIBE
Pose EstimationHuman3.6MAverage MPJPE (mm)65.6VIBE
Pose EstimationHuman3.6MPA-MPJPE41.4VIBE
Pose EstimationHuman3.6MAverage MPJPE (mm)65.6VIBE
Pose EstimationHuman3.6MFrames Needed16VIBE
3DMPI-INF-3DHPMPJPE96.6VIBE
3DMPI-INF-3DHPPA-MPJPE64.6VIBE
3DMPI-INF-3DHPPCK89.3VIBE
3D3DPWAcceleration Error23.4VIBE
3D3DPWMPJPE82.9VIBE
3D3DPWMPVPE99.1VIBE
3D3DPWNumber of parameters (M)72.43VIBE
3D3DPWPA-MPJPE51.9VIBE
3DHuman3.6MAverage MPJPE (mm)65.6VIBE
3DHuman3.6MPA-MPJPE41.4VIBE
3DHuman3.6MAverage MPJPE (mm)65.6VIBE
3DHuman3.6MFrames Needed16VIBE
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE96.6VIBE
1 Image, 2*2 StitchiMPI-INF-3DHPPA-MPJPE64.6VIBE
1 Image, 2*2 StitchiMPI-INF-3DHPPCK89.3VIBE
1 Image, 2*2 Stitchi3DPWAcceleration Error23.4VIBE
1 Image, 2*2 Stitchi3DPWMPJPE82.9VIBE
1 Image, 2*2 Stitchi3DPWMPVPE99.1VIBE
1 Image, 2*2 Stitchi3DPWNumber of parameters (M)72.43VIBE
1 Image, 2*2 Stitchi3DPWPA-MPJPE51.9VIBE
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)65.6VIBE
1 Image, 2*2 StitchiHuman3.6MPA-MPJPE41.4VIBE
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)65.6VIBE
1 Image, 2*2 StitchiHuman3.6MFrames Needed16VIBE

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16