TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MotionMixer: MLP-based 3D Human Body Pose Forecasting

MotionMixer: MLP-based 3D Human Body Pose Forecasting

Arij Bouazizi, Adrian Holzbock, Ulrich Kressel, Klaus Dietmayer, Vasileios Belagiannis

2022-07-01Human Pose Forecasting
PaperPDFCode(official)

Abstract

In this work, we present MotionMixer, an efficient 3D human body pose forecasting model based solely on multi-layer perceptrons (MLPs). MotionMixer learns the spatial-temporal 3D body pose dependencies by sequentially mixing both modalities. Given a stacked sequence of 3D body poses, a spatial-MLP extracts fine grained spatial dependencies of the body joints. The interaction of the body joints over time is then modelled by a temporal MLP. The spatial-temporal mixed features are finally aggregated and decoded to obtain the future motion. To calibrate the influence of each time step in the pose sequence, we make use of squeeze-and-excitation (SE) blocks. We evaluate our approach on Human3.6M, AMASS, and 3DPW datasets using the standard evaluation protocols. For all evaluations, we demonstrate state-of-the-art performance, while having a model with a smaller number of parameters. Our code is available at: https://github.com/MotionMLP/MotionMixer

Results

TaskDatasetMetricValueModel
Pose EstimationHuman3.6MAverage MPJPE (mm) @ 1000 ms111MotionMixer
Pose EstimationHuman3.6MAverage MPJPE (mm) @ 400ms59.3MotionMixer
Pose EstimationHuman3.6MMAR, walking, 1,000ms0.73MotionMixer
Pose EstimationHuman3.6MMAR, walking, 400ms0.58MotionMixer
3DHuman3.6MAverage MPJPE (mm) @ 1000 ms111MotionMixer
3DHuman3.6MAverage MPJPE (mm) @ 400ms59.3MotionMixer
3DHuman3.6MMAR, walking, 1,000ms0.73MotionMixer
3DHuman3.6MMAR, walking, 400ms0.58MotionMixer
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm) @ 1000 ms111MotionMixer
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm) @ 400ms59.3MotionMixer
1 Image, 2*2 StitchiHuman3.6MMAR, walking, 1,000ms0.73MotionMixer
1 Image, 2*2 StitchiHuman3.6MMAR, walking, 400ms0.58MotionMixer

Related Papers

MotionMap: Representing Multimodality in Human Pose Forecasting2024-12-25EgoCast: Forecasting Egocentric Human Pose in the Wild2024-12-03Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning2024-04-08Exploring 3D Human Pose Estimation and Forecasting from the Robot's Perspective: The HARPER Dataset2024-03-21Context-based Interpretable Spatio-Temporal Graph Convolutional Network for Human Motion Forecasting2024-02-21Expressive Forecasting of 3D Whole-body Human Motions2023-12-19GCNext: Towards the Unity of Graph Convolutions for Human Motion Prediction2023-12-19Personalized Pose Forecasting2023-12-06