Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction

Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Xinchao Wang, Yanfeng Wang

2023-08-17ICCV 2023 1Human Pose Forecasting Human motion prediction motion prediction

Abstract

Exploring spatial-temporal dependencies from observed motions is one of the core challenges of human motion prediction. Previous methods mainly focus on dedicated network structures to model the spatial and temporal dependencies. This paper considers a new direction by introducing a model learning framework with auxiliary tasks. In our auxiliary tasks, partial body joints' coordinates are corrupted by either masking or adding noise and the goal is to recover corrupted coordinates depending on the rest coordinates. To work with auxiliary tasks, we propose a novel auxiliary-adapted transformer, which can handle incomplete, corrupted motion data and achieve coordinate recovery via capturing spatial-temporal dependencies. Through auxiliary tasks, the auxiliary-adapted transformer is promoted to capture more comprehensive spatial-temporal dependencies among body joints' coordinates, leading to better feature learning. Extensive experimental results have shown that our method outperforms state-of-the-art methods by remarkable margins of 7.2%, 3.7%, and 9.4% in terms of 3D mean per joint position error (MPJPE) on the Human3.6M, CMU Mocap, and 3DPW datasets, respectively. We also demonstrate that our method is more robust under data missing cases and noisy data cases. Code is available at https://github.com/MediaBrain-SJTU/AuxFormer.

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	Human3.6M	Average MPJPE (mm) @ 1000 ms	107	AuxFormer
Pose Estimation	Human3.6M	Average MPJPE (mm) @ 400ms	54.1	AuxFormer
Pose Estimation	3DPW	Average MPJPE (mm) 1000 msec	107.45	AuxFormer
3D	Human3.6M	Average MPJPE (mm) @ 1000 ms	107	AuxFormer
3D	Human3.6M	Average MPJPE (mm) @ 400ms	54.1	AuxFormer
3D	3DPW	Average MPJPE (mm) 1000 msec	107.45	AuxFormer
1 Image, 2*2 Stitchi	Human3.6M	Average MPJPE (mm) @ 1000 ms	107	AuxFormer
1 Image, 2*2 Stitchi	Human3.6M	Average MPJPE (mm) @ 400ms	54.1	AuxFormer
1 Image, 2*2 Stitchi	3DPW	Average MPJPE (mm) 1000 msec	107.45	AuxFormer

Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction

Abstract

Results

Related Papers

Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction

Abstract

Results

Related Papers