TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Back to MLP: A Simple Baseline for Human Motion Prediction

Back to MLP: A Simple Baseline for Human Motion Prediction

Wen Guo, Yuming Du, Xi Shen, Vincent Lepetit, Xavier Alameda-Pineda, Francesc Moreno-Noguer

2022-07-04Human Pose ForecastingHuman motion predictionmotion predictionMulti-Person Pose forecasting
PaperPDFCode(official)

Abstract

This paper tackles the problem of human motion prediction, consisting in forecasting future body poses from historically observed sequences. State-of-the-art approaches provide good results, however, they rely on deep learning architectures of arbitrary complexity, such as Recurrent Neural Networks(RNN), Transformers or Graph Convolutional Networks(GCN), typically requiring multiple training stages and more than 2 million parameters. In this paper, we show that, after combining with a series of standard practices, such as applying Discrete Cosine Transform(DCT), predicting residual displacement of joints and optimizing velocity as an auxiliary loss, a light-weight network based on multi-layer perceptrons(MLPs) with only 0.14 million parameters can surpass the state-of-the-art performance. An exhaustive evaluation on the Human3.6M, AMASS, and 3DPW datasets shows that our method, named siMLPe, consistently outperforms all other approaches. We hope that our simple method could serve as a strong baseline for the community and allow re-thinking of the human motion prediction problem. The code is publicly available at \url{https://github.com/dulucas/siMLPe}.

Results

TaskDatasetMetricValueModel
Autonomous VehiclesExpi - common actions splitAverage MPJPE (mm) @ 1000 ms250siMLPe
Autonomous VehiclesExpi - common actions splitAverage MPJPE (mm) @ 400 ms128siMLPe
Autonomous VehiclesExpi - common actions splitAverage MPJPE (mm) @ 600 ms178siMLPe
Autonomous VehiclesExpi - unseen actions splitAverage MPJPE (mm) @ 400 ms131siMLPe
Autonomous VehiclesExpi - unseen actions splitAverage MPJPE (mm) @ 600 ms183siMLPe
Autonomous VehiclesExpi - unseen actions splitAverage MPJPE (mm) @ 800 ms225siMLPe
Pose EstimationHARPERAverage MPJPE (mm) @ 1000ms141SiMLPe
Pose EstimationHARPERAverage MPJPE (mm) @ 400ms60SiMLPe
Pose EstimationHARPERLast Frame MPJPE (mm) @ 1000ms264SiMLPe
Pose EstimationHARPERLast Frame MPJPE (mm) @ 400ms98SiMLPe
Pose EstimationAMASSAverage MPJPE (mm) 1000 msec65.7siMLPe
Pose EstimationHuman3.6MAverage MPJPE (mm) @ 1000 ms109.4siMLPe
Pose EstimationHuman3.6MAverage MPJPE (mm) @ 400ms57.3siMLPe
Pose Estimation3DPWAverage MPJPE (mm) 1000 msec72.2siMLPe
Pose EstimationExpi - common actions splitAverage MPJPE (mm) @ 200 ms80siMLPe
Motion ForecastingExpi - common actions splitAverage MPJPE (mm) @ 1000 ms250siMLPe
Motion ForecastingExpi - common actions splitAverage MPJPE (mm) @ 400 ms128siMLPe
Motion ForecastingExpi - common actions splitAverage MPJPE (mm) @ 600 ms178siMLPe
Motion ForecastingExpi - unseen actions splitAverage MPJPE (mm) @ 400 ms131siMLPe
Motion ForecastingExpi - unseen actions splitAverage MPJPE (mm) @ 600 ms183siMLPe
Motion ForecastingExpi - unseen actions splitAverage MPJPE (mm) @ 800 ms225siMLPe
3DHARPERAverage MPJPE (mm) @ 1000ms141SiMLPe
3DHARPERAverage MPJPE (mm) @ 400ms60SiMLPe
3DHARPERLast Frame MPJPE (mm) @ 1000ms264SiMLPe
3DHARPERLast Frame MPJPE (mm) @ 400ms98SiMLPe
3DAMASSAverage MPJPE (mm) 1000 msec65.7siMLPe
3DHuman3.6MAverage MPJPE (mm) @ 1000 ms109.4siMLPe
3DHuman3.6MAverage MPJPE (mm) @ 400ms57.3siMLPe
3D3DPWAverage MPJPE (mm) 1000 msec72.2siMLPe
3DExpi - common actions splitAverage MPJPE (mm) @ 200 ms80siMLPe
Autonomous DrivingExpi - common actions splitAverage MPJPE (mm) @ 1000 ms250siMLPe
Autonomous DrivingExpi - common actions splitAverage MPJPE (mm) @ 400 ms128siMLPe
Autonomous DrivingExpi - common actions splitAverage MPJPE (mm) @ 600 ms178siMLPe
Autonomous DrivingExpi - unseen actions splitAverage MPJPE (mm) @ 400 ms131siMLPe
Autonomous DrivingExpi - unseen actions splitAverage MPJPE (mm) @ 600 ms183siMLPe
Autonomous DrivingExpi - unseen actions splitAverage MPJPE (mm) @ 800 ms225siMLPe
1 Image, 2*2 StitchiHARPERAverage MPJPE (mm) @ 1000ms141SiMLPe
1 Image, 2*2 StitchiHARPERAverage MPJPE (mm) @ 400ms60SiMLPe
1 Image, 2*2 StitchiHARPERLast Frame MPJPE (mm) @ 1000ms264SiMLPe
1 Image, 2*2 StitchiHARPERLast Frame MPJPE (mm) @ 400ms98SiMLPe
1 Image, 2*2 StitchiAMASSAverage MPJPE (mm) 1000 msec65.7siMLPe
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm) @ 1000 ms109.4siMLPe
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm) @ 400ms57.3siMLPe
1 Image, 2*2 Stitchi3DPWAverage MPJPE (mm) 1000 msec72.2siMLPe
1 Image, 2*2 StitchiExpi - common actions splitAverage MPJPE (mm) @ 200 ms80siMLPe

Related Papers

Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic2025-07-05Temporal Continual Learning with Prior Compensation for Human Motion Prediction2025-07-05AMPLIFY: Actionless Motion Priors for Robot Learning from Videos2025-06-17FocalAD: Local Motion Planning for End-to-End Autonomous Driving2025-06-13TrajFlow: Multi-modal Motion Prediction via Flow Matching2025-06-10HUMOF: Human Motion Forecasting in Interactive Social Scenes2025-06-04Rodrigues Network for Learning Robot Actions2025-06-03Autoregression-free video prediction using diffusion model for mitigating error propagation2025-05-28