TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Pre...

Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction

Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Xinchao Wang, Yanfeng Wang

2023-08-17ICCV 2023 1Human Pose ForecastingHuman motion predictionmotion prediction
PaperPDFCode(official)

Abstract

Exploring spatial-temporal dependencies from observed motions is one of the core challenges of human motion prediction. Previous methods mainly focus on dedicated network structures to model the spatial and temporal dependencies. This paper considers a new direction by introducing a model learning framework with auxiliary tasks. In our auxiliary tasks, partial body joints' coordinates are corrupted by either masking or adding noise and the goal is to recover corrupted coordinates depending on the rest coordinates. To work with auxiliary tasks, we propose a novel auxiliary-adapted transformer, which can handle incomplete, corrupted motion data and achieve coordinate recovery via capturing spatial-temporal dependencies. Through auxiliary tasks, the auxiliary-adapted transformer is promoted to capture more comprehensive spatial-temporal dependencies among body joints' coordinates, leading to better feature learning. Extensive experimental results have shown that our method outperforms state-of-the-art methods by remarkable margins of 7.2%, 3.7%, and 9.4% in terms of 3D mean per joint position error (MPJPE) on the Human3.6M, CMU Mocap, and 3DPW datasets, respectively. We also demonstrate that our method is more robust under data missing cases and noisy data cases. Code is available at https://github.com/MediaBrain-SJTU/AuxFormer.

Results

TaskDatasetMetricValueModel
Pose EstimationHuman3.6MAverage MPJPE (mm) @ 1000 ms107AuxFormer
Pose EstimationHuman3.6MAverage MPJPE (mm) @ 400ms54.1AuxFormer
Pose Estimation3DPWAverage MPJPE (mm) 1000 msec107.45AuxFormer
3DHuman3.6MAverage MPJPE (mm) @ 1000 ms107AuxFormer
3DHuman3.6MAverage MPJPE (mm) @ 400ms54.1AuxFormer
3D3DPWAverage MPJPE (mm) 1000 msec107.45AuxFormer
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm) @ 1000 ms107AuxFormer
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm) @ 400ms54.1AuxFormer
1 Image, 2*2 Stitchi3DPWAverage MPJPE (mm) 1000 msec107.45AuxFormer

Related Papers

Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic2025-07-05Temporal Continual Learning with Prior Compensation for Human Motion Prediction2025-07-05AMPLIFY: Actionless Motion Priors for Robot Learning from Videos2025-06-17FocalAD: Local Motion Planning for End-to-End Autonomous Driving2025-06-13TrajFlow: Multi-modal Motion Prediction via Flow Matching2025-06-10HUMOF: Human Motion Forecasting in Interactive Social Scenes2025-06-04Rodrigues Network for Learning Robot Actions2025-06-03Autoregression-free video prediction using diffusion model for mitigating error propagation2025-05-28