TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Human Motion Diffusion Model

Human Motion Diffusion Model

Guy Tevet, Sigal Raab, Brian Gordon, Yonatan Shafir, Daniel Cohen-Or, Amit H. Bermano

2022-09-293D GenerationMotion GenerationMotion Synthesis
PaperPDFCode(official)

Abstract

Natural and expressive human motion generation is the holy grail of computer animation. It is a challenging task, due to the diversity of possible motion, human perceptual sensitivity to it, and the difficulty of accurately describing it. Therefore, current generative solutions are either low-quality or limited in expressiveness. Diffusion models, which have already shown remarkable generative capabilities in other domains, are promising candidates for human motion due to their many-to-many nature, but they tend to be resource hungry and hard to control. In this paper, we introduce Motion Diffusion Model (MDM), a carefully adapted classifier-free diffusion-based generative model for the human motion domain. MDM is transformer-based, combining insights from motion generation literature. A notable design-choice is the prediction of the sample, rather than the noise, in each diffusion step. This facilitates the use of established geometric losses on the locations and velocities of the motion, such as the foot contact loss. As we demonstrate, MDM is a generic approach, enabling different modes of conditioning, and different generation tasks. We show that our model is trained with lightweight resources and yet achieves state-of-the-art results on leading benchmarks for text-to-motion and action-to-motion. https://guytevet.github.io/mdm-page/ .

Results

TaskDatasetMetricValueModel
Image GenerationE.T. the Exceptional TrajectoriesClaTr-Score18.32MDM
Image GenerationE.T. the Exceptional TrajectoriesClassifier-F10.34MDM
Image GenerationE.T. the Exceptional TrajectoriesFD_ClaTr6.79MDM
Pose TrackingHumanML3DDiversity9.559MDM
Pose TrackingHumanML3DFID0.544MDM
Pose TrackingHumanML3DMultimodality2.799MDM
Pose TrackingHumanML3DR Precision Top30.611MDM
Pose TrackingInter-XFID23.701MDM
Pose TrackingInter-XMMDist9.548MDM
Pose TrackingInter-XMModality3.49MDM
Pose TrackingInter-XR-Precision Top30.426MDM
Pose TrackingInterHumanFID9.167MDM
Pose TrackingInterHumanMMDist7.125MDM
Pose TrackingInterHumanMModality2.35MDM
Pose TrackingInterHumanR-Precision Top30.339MDM
Pose TrackingMotion-XDiversity11.4MDM
Pose TrackingMotion-XFID3.8MDM
Pose TrackingMotion-XMModality2.53MDM
Pose TrackingMotion-XTMR-Matching Score0.84MDM
Pose TrackingMotion-XTMR-R-Precision Top30.6341MDM
Pose TrackingHumanAct12Accuracy0.99MDM
Pose TrackingHumanAct12FID0.08MDM
Pose TrackingHumanAct12Multimodality2.58MDM
Pose TrackingKIT Motion-LanguageDiversity10.847MDM
Pose TrackingKIT Motion-LanguageFID0.497MDM
Pose TrackingKIT Motion-LanguageMultimodality1.907MDM
Pose TrackingKIT Motion-LanguageR Precision Top30.396MDM
Motion SynthesisHumanML3DDiversity9.559MDM
Motion SynthesisHumanML3DFID0.544MDM
Motion SynthesisHumanML3DMultimodality2.799MDM
Motion SynthesisHumanML3DR Precision Top30.611MDM
Motion SynthesisInter-XFID23.701MDM
Motion SynthesisInter-XMMDist9.548MDM
Motion SynthesisInter-XMModality3.49MDM
Motion SynthesisInter-XR-Precision Top30.426MDM
Motion SynthesisInterHumanFID9.167MDM
Motion SynthesisInterHumanMMDist7.125MDM
Motion SynthesisInterHumanMModality2.35MDM
Motion SynthesisInterHumanR-Precision Top30.339MDM
Motion SynthesisMotion-XDiversity11.4MDM
Motion SynthesisMotion-XFID3.8MDM
Motion SynthesisMotion-XMModality2.53MDM
Motion SynthesisMotion-XTMR-Matching Score0.84MDM
Motion SynthesisMotion-XTMR-R-Precision Top30.6341MDM
Motion SynthesisHumanAct12Accuracy0.99MDM
Motion SynthesisHumanAct12FID0.08MDM
Motion SynthesisHumanAct12Multimodality2.58MDM
Motion SynthesisKIT Motion-LanguageDiversity10.847MDM
Motion SynthesisKIT Motion-LanguageFID0.497MDM
Motion SynthesisKIT Motion-LanguageMultimodality1.907MDM
Motion SynthesisKIT Motion-LanguageR Precision Top30.396MDM
10-shot image generationHumanML3DDiversity9.559MDM
10-shot image generationHumanML3DFID0.544MDM
10-shot image generationHumanML3DMultimodality2.799MDM
10-shot image generationHumanML3DR Precision Top30.611MDM
10-shot image generationInter-XFID23.701MDM
10-shot image generationInter-XMMDist9.548MDM
10-shot image generationInter-XMModality3.49MDM
10-shot image generationInter-XR-Precision Top30.426MDM
10-shot image generationInterHumanFID9.167MDM
10-shot image generationInterHumanMMDist7.125MDM
10-shot image generationInterHumanMModality2.35MDM
10-shot image generationInterHumanR-Precision Top30.339MDM
10-shot image generationMotion-XDiversity11.4MDM
10-shot image generationMotion-XFID3.8MDM
10-shot image generationMotion-XMModality2.53MDM
10-shot image generationMotion-XTMR-Matching Score0.84MDM
10-shot image generationMotion-XTMR-R-Precision Top30.6341MDM
10-shot image generationHumanAct12Accuracy0.99MDM
10-shot image generationHumanAct12FID0.08MDM
10-shot image generationHumanAct12Multimodality2.58MDM
10-shot image generationKIT Motion-LanguageDiversity10.847MDM
10-shot image generationKIT Motion-LanguageFID0.497MDM
10-shot image generationKIT Motion-LanguageMultimodality1.907MDM
10-shot image generationKIT Motion-LanguageR Precision Top30.396MDM
3D Human Pose TrackingHumanML3DDiversity9.559MDM
3D Human Pose TrackingHumanML3DFID0.544MDM
3D Human Pose TrackingHumanML3DMultimodality2.799MDM
3D Human Pose TrackingHumanML3DR Precision Top30.611MDM
3D Human Pose TrackingInter-XFID23.701MDM
3D Human Pose TrackingInter-XMMDist9.548MDM
3D Human Pose TrackingInter-XMModality3.49MDM
3D Human Pose TrackingInter-XR-Precision Top30.426MDM
3D Human Pose TrackingInterHumanFID9.167MDM
3D Human Pose TrackingInterHumanMMDist7.125MDM
3D Human Pose TrackingInterHumanMModality2.35MDM
3D Human Pose TrackingInterHumanR-Precision Top30.339MDM
3D Human Pose TrackingMotion-XDiversity11.4MDM
3D Human Pose TrackingMotion-XFID3.8MDM
3D Human Pose TrackingMotion-XMModality2.53MDM
3D Human Pose TrackingMotion-XTMR-Matching Score0.84MDM
3D Human Pose TrackingMotion-XTMR-R-Precision Top30.6341MDM
3D Human Pose TrackingHumanAct12Accuracy0.99MDM
3D Human Pose TrackingHumanAct12FID0.08MDM
3D Human Pose TrackingHumanAct12Multimodality2.58MDM
3D Human Pose TrackingKIT Motion-LanguageDiversity10.847MDM
3D Human Pose TrackingKIT Motion-LanguageFID0.497MDM
3D Human Pose TrackingKIT Motion-LanguageMultimodality1.907MDM
3D Human Pose TrackingKIT Motion-LanguageR Precision Top30.396MDM
3D GenerationE.T. the Exceptional TrajectoriesClaTr-Score18.32MDM
3D GenerationE.T. the Exceptional TrajectoriesClassifier-F10.34MDM
3D GenerationE.T. the Exceptional TrajectoriesFD_ClaTr6.79MDM

Related Papers

AutoPartGen: Autogressive 3D Part Generation and Discovery2025-07-17PhysX: Physical-Grounded 3D Asset Generation2025-07-16SnapMoGen: Human Motion Generation from Expressive Texts2025-07-12Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data2025-07-09DreamArt: Generating Interactable Articulated Objects from a Single Image2025-07-08OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion2025-07-08Acquiring and Adapting Priors for Novel Tasks via Neural Meta-Architectures2025-07-07Motion Generation: A Survey of Generative Approaches and Benchmarks2025-07-07