TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HP-GAN: Probabilistic 3D human motion prediction via GAN

HP-GAN: Probabilistic 3D human motion prediction via GAN

Emad Barsoum, John Kender, Zicheng Liu

2017-11-27Autonomous VehiclesHuman Pose ForecastingSynthetic Data GenerationMotion EstimationHuman motion predictionmotion predictionPredictionMotion Synthesis
PaperPDFCodeCodeCode

Abstract

Predicting and understanding human motion dynamics has many applications, such as motion synthesis, augmented reality, security, and autonomous vehicles. Due to the recent success of generative adversarial networks (GAN), there has been much interest in probabilistic estimation and synthetic data generation using deep neural network architectures and learning algorithms. We propose a novel sequence-to-sequence model for probabilistic human motion prediction, trained with a modified version of improved Wasserstein generative adversarial networks (WGAN-GP), in which we use a custom loss function designed for human motion prediction. Our model, which we call HP-GAN, learns a probability density function of future human poses conditioned on previous poses. It predicts multiple sequences of possible future human poses, each from the same input sequence but a different vector z drawn from a random distribution. Furthermore, to quantify the quality of the non-deterministic predictions, we simultaneously train a motion-quality-assessment model that learns the probability that a given skeleton sequence is a real human motion. We test our algorithm on two of the largest skeleton datasets: NTURGB-D and Human3.6M. We train our model on both single and multiple action types. Its predictive power for long-term motion estimation is demonstrated by generating multiple plausible futures of more than 30 frames from just 10 frames of input. We show that most sequences generated from the same input have more than 50\% probabilities of being judged as a real human sequence. We will release all the code used in this paper to Github.

Results

TaskDatasetMetricValueModel
Pose EstimationHuman3.6MADE858HP-GAN
Pose EstimationHuman3.6MAPD7214HP-GAN
Pose EstimationHuman3.6MFDE867HP-GAN
Pose EstimationHuman3.6MMMADE847HP-GAN
Pose EstimationHuman3.6MMMFDE858HP-GAN
Pose EstimationHumanEva-IADE@2000ms772HP-GAN
Pose EstimationHumanEva-IAPD@2000ms1139HP-GAN
Pose EstimationHumanEva-IFDE@2000ms749HP-GAN
3DHuman3.6MADE858HP-GAN
3DHuman3.6MAPD7214HP-GAN
3DHuman3.6MFDE867HP-GAN
3DHuman3.6MMMADE847HP-GAN
3DHuman3.6MMMFDE858HP-GAN
3DHumanEva-IADE@2000ms772HP-GAN
3DHumanEva-IAPD@2000ms1139HP-GAN
3DHumanEva-IFDE@2000ms749HP-GAN
1 Image, 2*2 StitchiHuman3.6MADE858HP-GAN
1 Image, 2*2 StitchiHuman3.6MAPD7214HP-GAN
1 Image, 2*2 StitchiHuman3.6MFDE867HP-GAN
1 Image, 2*2 StitchiHuman3.6MMMADE847HP-GAN
1 Image, 2*2 StitchiHuman3.6MMMFDE858HP-GAN
1 Image, 2*2 StitchiHumanEva-IADE@2000ms772HP-GAN
1 Image, 2*2 StitchiHumanEva-IAPD@2000ms1139HP-GAN
1 Image, 2*2 StitchiHumanEva-IFDE@2000ms749HP-GAN

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15Lightweight Safety Guardrails via Synthetic Data and RL-guided Adversarial Training2025-07-11Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins2025-07-11HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking2025-07-10Foundation models for time series forecasting: Application in conformal prediction2025-07-09