TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TransFusion: A Practical and Effective Transformer-based D...

TransFusion: A Practical and Effective Transformer-based Diffusion Model for 3D Human Motion Prediction

Sibo Tian, Minghui Zheng, Xiao Liang

2023-07-30Human Pose ForecastingHuman motion predictionmotion prediction
PaperPDFCode(official)

Abstract

Predicting human motion plays a crucial role in ensuring a safe and effective human-robot close collaboration in intelligent remanufacturing systems of the future. Existing works can be categorized into two groups: those focusing on accuracy, predicting a single future motion, and those generating diverse predictions based on observations. The former group fails to address the uncertainty and multi-modal nature of human motion, while the latter group often produces motion sequences that deviate too far from the ground truth or become unrealistic within historical contexts. To tackle these issues, we propose TransFusion, an innovative and practical diffusion-based model for 3D human motion prediction which can generate samples that are more likely to happen while maintaining a certain level of diversity. Our model leverages Transformer as the backbone with long skip connections between shallow and deep layers. Additionally, we employ the discrete cosine transform to model motion sequences in the frequency space, thereby improving performance. In contrast to prior diffusion-based models that utilize extra modules like cross-attention and adaptive layer normalization to condition the prediction on past observed motion, we treat all inputs, including conditions, as tokens to create a more lightweight model compared to existing approaches. Extensive experimental studies are conducted on benchmark datasets to validate the effectiveness of our human motion prediction model.

Results

TaskDatasetMetricValueModel
Pose EstimationAMASSADE0.508TransFusion
Pose EstimationAMASSAPD8.853TransFusion
Pose EstimationAMASSFDE0.568TransFusion
Pose EstimationHuman3.6MADE358TransFusion
Pose EstimationHuman3.6MAPD5975TransFusion
Pose EstimationHuman3.6MFDE468TransFusion
Pose EstimationHuman3.6MMMADE506TransFusion
Pose EstimationHuman3.6MMMFDE539TransFusion
Pose EstimationHumanEva-IADE@2000ms204TransFusion
Pose EstimationHumanEva-IAPD@2000ms1031TransFusion
Pose EstimationHumanEva-IFDE@2000ms234TransFusion
Pose EstimationHumanEva-IMMADE@2000ms408TransFusion
Pose EstimationHumanEva-IMMFDE@2000ms427TransFusion
3DAMASSADE0.508TransFusion
3DAMASSAPD8.853TransFusion
3DAMASSFDE0.568TransFusion
3DHuman3.6MADE358TransFusion
3DHuman3.6MAPD5975TransFusion
3DHuman3.6MFDE468TransFusion
3DHuman3.6MMMADE506TransFusion
3DHuman3.6MMMFDE539TransFusion
3DHumanEva-IADE@2000ms204TransFusion
3DHumanEva-IAPD@2000ms1031TransFusion
3DHumanEva-IFDE@2000ms234TransFusion
3DHumanEva-IMMADE@2000ms408TransFusion
3DHumanEva-IMMFDE@2000ms427TransFusion
1 Image, 2*2 StitchiAMASSADE0.508TransFusion
1 Image, 2*2 StitchiAMASSAPD8.853TransFusion
1 Image, 2*2 StitchiAMASSFDE0.568TransFusion
1 Image, 2*2 StitchiHuman3.6MADE358TransFusion
1 Image, 2*2 StitchiHuman3.6MAPD5975TransFusion
1 Image, 2*2 StitchiHuman3.6MFDE468TransFusion
1 Image, 2*2 StitchiHuman3.6MMMADE506TransFusion
1 Image, 2*2 StitchiHuman3.6MMMFDE539TransFusion
1 Image, 2*2 StitchiHumanEva-IADE@2000ms204TransFusion
1 Image, 2*2 StitchiHumanEva-IAPD@2000ms1031TransFusion
1 Image, 2*2 StitchiHumanEva-IFDE@2000ms234TransFusion
1 Image, 2*2 StitchiHumanEva-IMMADE@2000ms408TransFusion
1 Image, 2*2 StitchiHumanEva-IMMFDE@2000ms427TransFusion

Related Papers

Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic2025-07-05Temporal Continual Learning with Prior Compensation for Human Motion Prediction2025-07-05AMPLIFY: Actionless Motion Priors for Robot Learning from Videos2025-06-17FocalAD: Local Motion Planning for End-to-End Autonomous Driving2025-06-13TrajFlow: Multi-modal Motion Prediction via Flow Matching2025-06-10HUMOF: Human Motion Forecasting in Interactive Social Scenes2025-06-04Rodrigues Network for Learning Robot Actions2025-06-03Autoregression-free video prediction using diffusion model for mitigating error propagation2025-05-28