TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Mode...

Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss

Wenshuo Chen, Haozhe Jia, Songning Lai, Keming Wu, Hongru Xiao, Lijie Hu, Yutao Yue

2025-01-30DenoisingMotion GenerationMotion Synthesis
PaperPDFCode(official)

Abstract

Rapid progress in text-to-motion generation has been largely driven by diffusion models. However, existing methods focus solely on temporal modeling, thereby overlooking frequency-domain analysis. We identify two key phases in motion denoising: the **semantic planning stage** and the **fine-grained improving stage**. To address these phases effectively, we propose **Fre**quency **e**nhanced **t**ext-**to**-**m**otion diffusion model (**Free-T2M**), incorporating stage-specific consistency losses that enhance the robustness of static features and improve fine-grained accuracy. Extensive experiments demonstrate the effectiveness of our method. Specifically, on StableMoFusion, our method reduces the FID from **0.189** to **0.051**, establishing a new SOTA performance within the diffusion architecture. These findings highlight the importance of incorporating frequency-domain insights into text-to-motion generation for more precise and robust results.

Results

TaskDatasetMetricValueModel
Pose TrackingHumanML3DDiversity9.48Free-T2M (StableMoFusion)
Pose TrackingHumanML3DFID0.051Free-T2M (StableMoFusion)
Pose TrackingHumanML3DR Precision Top30.803Free-T2M (StableMoFusion)
Pose TrackingKIT Motion-LanguageDiversity10.902Free-T2M (StableMoFusion)
Pose TrackingKIT Motion-LanguageFID0.155Free-T2M (StableMoFusion)
Pose TrackingKIT Motion-LanguageR Precision Top30.789Free-T2M (StableMoFusion)
Motion SynthesisHumanML3DDiversity9.48Free-T2M (StableMoFusion)
Motion SynthesisHumanML3DFID0.051Free-T2M (StableMoFusion)
Motion SynthesisHumanML3DR Precision Top30.803Free-T2M (StableMoFusion)
Motion SynthesisKIT Motion-LanguageDiversity10.902Free-T2M (StableMoFusion)
Motion SynthesisKIT Motion-LanguageFID0.155Free-T2M (StableMoFusion)
Motion SynthesisKIT Motion-LanguageR Precision Top30.789Free-T2M (StableMoFusion)
10-shot image generationHumanML3DDiversity9.48Free-T2M (StableMoFusion)
10-shot image generationHumanML3DFID0.051Free-T2M (StableMoFusion)
10-shot image generationHumanML3DR Precision Top30.803Free-T2M (StableMoFusion)
10-shot image generationKIT Motion-LanguageDiversity10.902Free-T2M (StableMoFusion)
10-shot image generationKIT Motion-LanguageFID0.155Free-T2M (StableMoFusion)
10-shot image generationKIT Motion-LanguageR Precision Top30.789Free-T2M (StableMoFusion)
3D Human Pose TrackingHumanML3DDiversity9.48Free-T2M (StableMoFusion)
3D Human Pose TrackingHumanML3DFID0.051Free-T2M (StableMoFusion)
3D Human Pose TrackingHumanML3DR Precision Top30.803Free-T2M (StableMoFusion)
3D Human Pose TrackingKIT Motion-LanguageDiversity10.902Free-T2M (StableMoFusion)
3D Human Pose TrackingKIT Motion-LanguageFID0.155Free-T2M (StableMoFusion)
3D Human Pose TrackingKIT Motion-LanguageR Precision Top30.789Free-T2M (StableMoFusion)

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing2025-07-15AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air2025-07-15SnapMoGen: Human Motion Generation from Expressive Texts2025-07-12A statistical physics framework for optimal learning2025-07-10Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data2025-07-09