TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Diffusion Motion: Generate Text-Guided 3D Human Motion by ...

Diffusion Motion: Generate Text-Guided 3D Human Motion by Diffusion Model

Zhiyuan Ren, Zhihong Pan, Xin Zhou, Le Kang

2022-10-22DenoisingImage GenerationMotion Synthesis
PaperPDF

Abstract

We propose a simple and novel method for generating 3D human motion from complex natural language sentences, which describe different velocity, direction and composition of all kinds of actions. Different from existing methods that use classical generative architecture, we apply the Denoising Diffusion Probabilistic Model to this task, synthesizing diverse motion results under the guidance of texts. The diffusion model converts white noise into structured 3D motion by a Markov process with a series of denoising steps and is efficiently trained by optimizing a variational lower bound. To achieve the goal of text-conditioned image synthesis, we use the classifier-free guidance strategy to fuse text embedding into the model during training. Our experiments demonstrate that our model achieves competitive results on HumanML3D test set quantitatively and can generate more visually natural and diverse examples. We also show with experiments that our model is capable of zero-shot generation of motions for unseen text guidance.

Results

TaskDatasetMetricValueModel
Pose TrackingHumanML3DDiversity23.692Diffuion Motion
Pose TrackingHumanML3DFID10.21Diffuion Motion
Pose TrackingHumanML3DR Precision Top30.735Diffuion Motion
Motion SynthesisHumanML3DDiversity23.692Diffuion Motion
Motion SynthesisHumanML3DFID10.21Diffuion Motion
Motion SynthesisHumanML3DR Precision Top30.735Diffuion Motion
10-shot image generationHumanML3DDiversity23.692Diffuion Motion
10-shot image generationHumanML3DFID10.21Diffuion Motion
10-shot image generationHumanML3DR Precision Top30.735Diffuion Motion
3D Human Pose TrackingHumanML3DDiversity23.692Diffuion Motion
3D Human Pose TrackingHumanML3DFID10.21Diffuion Motion
3D Human Pose TrackingHumanML3DR Precision Top30.735Diffuion Motion

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16FADE: Adversarial Concept Erasure in Flow Models2025-07-16