TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A generic diffusion-based approach for 3D human pose predi...

A generic diffusion-based approach for 3D human pose prediction in the wild

Saeed Saadatnejad, Ali Rasekh, Mohammadreza Mofayezi, Yasamin Medghalchi, Sara Rajabzadeh, Taylor Mordan, Alexandre Alahi

2022-10-11DenoisingHuman Pose ForecastingPredictionPose PredictionMissing Elements
PaperPDFCode(official)

Abstract

Predicting 3D human poses in real-world scenarios, also known as human pose forecasting, is inevitably subject to noisy inputs arising from inaccurate 3D pose estimations and occlusions. To address these challenges, we propose a diffusion-based approach that can predict given noisy observations. We frame the prediction task as a denoising problem, where both observation and prediction are considered as a single sequence containing missing elements (whether in the observation or prediction horizon). All missing elements are treated as noise and denoised with our conditional diffusion model. To better handle long-term forecasting horizon, we present a temporal cascaded diffusion model. We demonstrate the benefits of our approach on four publicly available datasets (Human3.6M, HumanEva-I, AMASS, and 3DPW), outperforming the state-of-the-art. Additionally, we show that our framework is generic enough to improve any 3D pose prediction model as a pre-processing step to repair their inputs and a post-processing step to refine their outputs. The code is available online: \url{https://github.com/vita-epfl/DePOSit}.

Results

TaskDatasetMetricValueModel
Pose EstimationAMASSFDE@1000ms (mm)66.7TCD
Pose EstimationAMASSFDE@560ms (mm)49.8TCD
Pose EstimationAMASSFDE@720ms (mm)54.5TCD
Pose EstimationAMASSFDE@880ms (mm)60.1TCD
Pose EstimationHuman3.6MADE356TCD
Pose EstimationHuman3.6MAPD19466TCD
Pose EstimationHuman3.6MFDE396TCD
Pose EstimationHuman3.6MMMADE463TCD
Pose EstimationHuman3.6MMMFDE445TCD
Pose EstimationHumanEva-IADE@2000ms199TCD
Pose EstimationHumanEva-IAPD@2000ms6764TCD
Pose EstimationHumanEva-IFDE@2000ms215TCD
Pose Estimation3DPWFDE@1000ms (mm)73.4TCD
Pose Estimation3DPWFDE@560ms (mm)55.4TCD
Pose Estimation3DPWFDE@720ms (mm)61.6TCD
Pose Estimation3DPWFDE@880ms (mm)67.9TCD
3DAMASSFDE@1000ms (mm)66.7TCD
3DAMASSFDE@560ms (mm)49.8TCD
3DAMASSFDE@720ms (mm)54.5TCD
3DAMASSFDE@880ms (mm)60.1TCD
3DHuman3.6MADE356TCD
3DHuman3.6MAPD19466TCD
3DHuman3.6MFDE396TCD
3DHuman3.6MMMADE463TCD
3DHuman3.6MMMFDE445TCD
3DHumanEva-IADE@2000ms199TCD
3DHumanEva-IAPD@2000ms6764TCD
3DHumanEva-IFDE@2000ms215TCD
3D3DPWFDE@1000ms (mm)73.4TCD
3D3DPWFDE@560ms (mm)55.4TCD
3D3DPWFDE@720ms (mm)61.6TCD
3D3DPWFDE@880ms (mm)67.9TCD
1 Image, 2*2 StitchiAMASSFDE@1000ms (mm)66.7TCD
1 Image, 2*2 StitchiAMASSFDE@560ms (mm)49.8TCD
1 Image, 2*2 StitchiAMASSFDE@720ms (mm)54.5TCD
1 Image, 2*2 StitchiAMASSFDE@880ms (mm)60.1TCD
1 Image, 2*2 StitchiHuman3.6MADE356TCD
1 Image, 2*2 StitchiHuman3.6MAPD19466TCD
1 Image, 2*2 StitchiHuman3.6MFDE396TCD
1 Image, 2*2 StitchiHuman3.6MMMADE463TCD
1 Image, 2*2 StitchiHuman3.6MMMFDE445TCD
1 Image, 2*2 StitchiHumanEva-IADE@2000ms199TCD
1 Image, 2*2 StitchiHumanEva-IAPD@2000ms6764TCD
1 Image, 2*2 StitchiHumanEva-IFDE@2000ms215TCD
1 Image, 2*2 Stitchi3DPWFDE@1000ms (mm)73.4TCD
1 Image, 2*2 Stitchi3DPWFDE@560ms (mm)55.4TCD
1 Image, 2*2 Stitchi3DPWFDE@720ms (mm)61.6TCD
1 Image, 2*2 Stitchi3DPWFDE@880ms (mm)67.9TCD

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing2025-07-15AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air2025-07-15Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins2025-07-11