Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Yu Guang Wang, Xinchao Wang, Yanfeng Wang
Learning to predict agent motions with relationship reasoning is important for many applications. In motion prediction tasks, maintaining motion equivariance under Euclidean geometric transformations and invariance of agent interaction is a critical and fundamental principle. However, such equivariance and invariance properties are overlooked by most existing methods. To fill this gap, we propose EqMotion, an efficient equivariant motion prediction model with invariant interaction reasoning. To achieve motion equivariance, we propose an equivariant geometric feature learning module to learn a Euclidean transformable feature through dedicated designs of equivariant operations. To reason agent's interactions, we propose an invariant interaction reasoning module to achieve a more stable interaction modeling. To further promote more comprehensive motion features, we propose an invariant pattern feature learning module to learn an invariant pattern feature, which cooperates with the equivariant geometric feature to enhance network expressiveness. We conduct experiments for the proposed model on four distinct scenarios: particle dynamics, molecule dynamics, human skeleton motion prediction and pedestrian trajectory prediction. Experimental results show that our method is not only generally applicable, but also achieves state-of-the-art prediction performances on all the four tasks, improving by 24.0/30.1/8.6/9.2%. Code is available at https://github.com/MediaBrain-SJTU/EqMotion.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Trajectory Prediction | ETH/UCY | ADE-8/12 | 0.21 | EqMotion |
| Pose Estimation | HARPER | Average MPJPE (mm) @ 1000ms | 104 | EqMotion |
| Pose Estimation | HARPER | Average MPJPE (mm) @ 400ms | 41 | EqMotion |
| Pose Estimation | HARPER | Last Frame MPJPE (mm) @ 1000ms | 197 | EqMotion |
| Pose Estimation | HARPER | Last Frame MPJPE (mm) @ 400ms | 69 | EqMotion |
| Pose Estimation | Human3.6M | Average MPJPE (mm) @ 1000 ms | 106.9 | EqMotion |
| Pose Estimation | Human3.6M | Average MPJPE (mm) @ 400ms | 55 | EqMotion |
| 3D | HARPER | Average MPJPE (mm) @ 1000ms | 104 | EqMotion |
| 3D | HARPER | Average MPJPE (mm) @ 400ms | 41 | EqMotion |
| 3D | HARPER | Last Frame MPJPE (mm) @ 1000ms | 197 | EqMotion |
| 3D | HARPER | Last Frame MPJPE (mm) @ 400ms | 69 | EqMotion |
| 3D | Human3.6M | Average MPJPE (mm) @ 1000 ms | 106.9 | EqMotion |
| 3D | Human3.6M | Average MPJPE (mm) @ 400ms | 55 | EqMotion |
| 1 Image, 2*2 Stitchi | HARPER | Average MPJPE (mm) @ 1000ms | 104 | EqMotion |
| 1 Image, 2*2 Stitchi | HARPER | Average MPJPE (mm) @ 400ms | 41 | EqMotion |
| 1 Image, 2*2 Stitchi | HARPER | Last Frame MPJPE (mm) @ 1000ms | 197 | EqMotion |
| 1 Image, 2*2 Stitchi | HARPER | Last Frame MPJPE (mm) @ 400ms | 69 | EqMotion |
| 1 Image, 2*2 Stitchi | Human3.6M | Average MPJPE (mm) @ 1000 ms | 106.9 | EqMotion |
| 1 Image, 2*2 Stitchi | Human3.6M | Average MPJPE (mm) @ 400ms | 55 | EqMotion |