TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Motion Mamba: Efficient and Long Sequence Motion Generation

Motion Mamba: Efficient and Long Sequence Motion Generation

Zeyu Zhang, Akide Liu, Ian Reid, Richard Hartley, Bohan Zhuang, Hao Tang

2024-03-12Motion GenerationMotion Synthesis
PaperPDFCode(official)

Abstract

Human motion generation stands as a significant pursuit in generative computer vision, while achieving long-sequence and efficient motion generation remains challenging. Recent advancements in state space models (SSMs), notably Mamba, have showcased considerable promise in long sequence modeling with an efficient hardware-aware design, which appears to be a promising direction to build motion generation model upon it. Nevertheless, adapting SSMs to motion generation faces hurdles since the lack of a specialized design architecture to model motion sequence. To address these challenges, we propose Motion Mamba, a simple and efficient approach that presents the pioneering motion generation model utilized SSMs. Specifically, we design a Hierarchical Temporal Mamba (HTM) block to process temporal data by ensemble varying numbers of isolated SSM modules across a symmetric U-Net architecture aimed at preserving motion consistency between frames. We also design a Bidirectional Spatial Mamba (BSM) block to bidirectionally process latent poses, to enhance accurate motion generation within a temporal frame. Our proposed method achieves up to 50% FID improvement and up to 4 times faster on the HumanML3D and KIT-ML datasets compared to the previous best diffusion-based method, which demonstrates strong capabilities of high-quality long sequence motion modeling and real-time human motion generation. See project website https://steve-zeyu-zhang.github.io/MotionMamba/

Results

TaskDatasetMetricValueModel
Pose TrackingHumanML3DDiversity9.871Motion Mamba
Pose TrackingHumanML3DFID0.281Motion Mamba
Pose TrackingHumanML3DMultimodality2.294Motion Mamba
Pose TrackingHumanML3DR Precision Top30.792Motion Mamba
Pose TrackingKIT Motion-LanguageDiversity11.02Motion Mamba
Pose TrackingKIT Motion-LanguageFID0.307Motion Mamba
Pose TrackingKIT Motion-LanguageMultimodality1.678Motion Mamba
Pose TrackingKIT Motion-LanguageR Precision Top30.765Motion Mamba
Motion SynthesisHumanML3DDiversity9.871Motion Mamba
Motion SynthesisHumanML3DFID0.281Motion Mamba
Motion SynthesisHumanML3DMultimodality2.294Motion Mamba
Motion SynthesisHumanML3DR Precision Top30.792Motion Mamba
Motion SynthesisKIT Motion-LanguageDiversity11.02Motion Mamba
Motion SynthesisKIT Motion-LanguageFID0.307Motion Mamba
Motion SynthesisKIT Motion-LanguageMultimodality1.678Motion Mamba
Motion SynthesisKIT Motion-LanguageR Precision Top30.765Motion Mamba
10-shot image generationHumanML3DDiversity9.871Motion Mamba
10-shot image generationHumanML3DFID0.281Motion Mamba
10-shot image generationHumanML3DMultimodality2.294Motion Mamba
10-shot image generationHumanML3DR Precision Top30.792Motion Mamba
10-shot image generationKIT Motion-LanguageDiversity11.02Motion Mamba
10-shot image generationKIT Motion-LanguageFID0.307Motion Mamba
10-shot image generationKIT Motion-LanguageMultimodality1.678Motion Mamba
10-shot image generationKIT Motion-LanguageR Precision Top30.765Motion Mamba
3D Human Pose TrackingHumanML3DDiversity9.871Motion Mamba
3D Human Pose TrackingHumanML3DFID0.281Motion Mamba
3D Human Pose TrackingHumanML3DMultimodality2.294Motion Mamba
3D Human Pose TrackingHumanML3DR Precision Top30.792Motion Mamba
3D Human Pose TrackingKIT Motion-LanguageDiversity11.02Motion Mamba
3D Human Pose TrackingKIT Motion-LanguageFID0.307Motion Mamba
3D Human Pose TrackingKIT Motion-LanguageMultimodality1.678Motion Mamba
3D Human Pose TrackingKIT Motion-LanguageR Precision Top30.765Motion Mamba

Related Papers

SnapMoGen: Human Motion Generation from Expressive Texts2025-07-12Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data2025-07-09Motion Generation: A Survey of Generative Approaches and Benchmarks2025-07-07DeepGesture: A conversational gesture synthesis system based on emotions and semantics2025-07-03A Unified Transformer-Based Framework with Pretraining For Whole Body Grasping Motion Generation2025-07-01VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and Collisions2025-06-29DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling2025-06-23PlanMoGPT: Flow-Enhanced Progressive Planning for Text to Motion Synthesis2025-06-22