TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Diverse Human Motion Prediction via Gumbel-Softmax Samplin...

Diverse Human Motion Prediction via Gumbel-Softmax Sampling from an Auxiliary Space

Lingwei Dang, Yongwei Nie, Chengjiang Long, Qing Zhang, Guiqing Li

2022-07-15Human Pose ForecastingHuman motion predictionmotion prediction
PaperPDFCode(official)Code(official)

Abstract

Diverse human motion prediction aims at predicting multiple possible future pose sequences from a sequence of observed poses. Previous approaches usually employ deep generative networks to model the conditional distribution of data, and then randomly sample outcomes from the distribution. While different results can be obtained, they are usually the most likely ones which are not diverse enough. Recent work explicitly learns multiple modes of the conditional distribution via a deterministic network, which however can only cover a fixed number of modes within a limited range. In this paper, we propose a novel sampling strategy for sampling very diverse results from an imbalanced multimodal distribution learned by a deep generative model. Our method works by generating an auxiliary space and smartly making randomly sampling from the auxiliary space equivalent to the diverse sampling from the target distribution. We propose a simple yet effective network architecture that implements this novel sampling strategy, which incorporates a Gumbel-Softmax coefficient matrix sampling method and an aggressive diversity promoting hinge loss function. Extensive experiments demonstrate that our method significantly improves both the diversity and accuracy of the samplings compared with previous state-of-the-art sampling approaches. Code and pre-trained models are available at https://github.com/Droliven/diverse_sampling.

Results

TaskDatasetMetricValueModel
Pose EstimationHuman3.6MADE370DiverseSampling
Pose EstimationHuman3.6MAPD15310DiverseSampling
Pose EstimationHuman3.6MCMD11.692DiverseSampling
Pose EstimationHuman3.6MFDE485DiverseSampling
Pose EstimationHuman3.6MFID2.083DiverseSampling
Pose EstimationHuman3.6MMMADE475DiverseSampling
Pose EstimationHuman3.6MMMFDE516DiverseSampling
Pose EstimationHumanEva-IADE@2000ms220DHMP
Pose EstimationHumanEva-IAPD@2000ms6109DHMP
Pose EstimationHumanEva-IFDE@2000ms234DHMP
Pose EstimationHumanEva-IMMADE@2000ms342DHMP
Pose EstimationHumanEva-IMMFDE@2000ms316DHMP
3DHuman3.6MADE370DiverseSampling
3DHuman3.6MAPD15310DiverseSampling
3DHuman3.6MCMD11.692DiverseSampling
3DHuman3.6MFDE485DiverseSampling
3DHuman3.6MFID2.083DiverseSampling
3DHuman3.6MMMADE475DiverseSampling
3DHuman3.6MMMFDE516DiverseSampling
3DHumanEva-IADE@2000ms220DHMP
3DHumanEva-IAPD@2000ms6109DHMP
3DHumanEva-IFDE@2000ms234DHMP
3DHumanEva-IMMADE@2000ms342DHMP
3DHumanEva-IMMFDE@2000ms316DHMP
1 Image, 2*2 StitchiHuman3.6MADE370DiverseSampling
1 Image, 2*2 StitchiHuman3.6MAPD15310DiverseSampling
1 Image, 2*2 StitchiHuman3.6MCMD11.692DiverseSampling
1 Image, 2*2 StitchiHuman3.6MFDE485DiverseSampling
1 Image, 2*2 StitchiHuman3.6MFID2.083DiverseSampling
1 Image, 2*2 StitchiHuman3.6MMMADE475DiverseSampling
1 Image, 2*2 StitchiHuman3.6MMMFDE516DiverseSampling
1 Image, 2*2 StitchiHumanEva-IADE@2000ms220DHMP
1 Image, 2*2 StitchiHumanEva-IAPD@2000ms6109DHMP
1 Image, 2*2 StitchiHumanEva-IFDE@2000ms234DHMP
1 Image, 2*2 StitchiHumanEva-IMMADE@2000ms342DHMP
1 Image, 2*2 StitchiHumanEva-IMMFDE@2000ms316DHMP

Related Papers

Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic2025-07-05Temporal Continual Learning with Prior Compensation for Human Motion Prediction2025-07-05AMPLIFY: Actionless Motion Priors for Robot Learning from Videos2025-06-17FocalAD: Local Motion Planning for End-to-End Autonomous Driving2025-06-13TrajFlow: Multi-modal Motion Prediction via Flow Matching2025-06-10HUMOF: Human Motion Forecasting in Interactive Social Scenes2025-06-04Rodrigues Network for Learning Robot Actions2025-06-03Autoregression-free video prediction using diffusion model for mitigating error propagation2025-05-28