TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Lodge: A Coarse to Fine Diffusion Network for Long Dance G...

Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives

Ronghui Li, Yuxiang Zhang, Yachao Zhang, Hongwen Zhang, Jie Guo, Yan Zhang, Yebin Liu, Xiu Li

2024-03-15CVPR 2024 1Motion Synthesis
PaperPDFCode(official)

Abstract

We propose Lodge, a network capable of generating extremely long dance sequences conditioned on given music. We design Lodge as a two-stage coarse to fine diffusion architecture, and propose the characteristic dance primitives that possess significant expressiveness as intermediate representations between two diffusion models. The first stage is global diffusion, which focuses on comprehending the coarse-level music-dance correlation and production characteristic dance primitives. In contrast, the second-stage is the local diffusion, which parallelly generates detailed motion sequences under the guidance of the dance primitives and choreographic rules. In addition, we propose a Foot Refine Block to optimize the contact between the feet and the ground, enhancing the physical realism of the motion. Our approach can parallelly generate dance sequences of extremely long length, striking a balance between global choreographic patterns and local motion quality and expressiveness. Extensive experiments validate the efficacy of our method.

Results

TaskDatasetMetricValueModel
Pose TrackingFineDanceBAS0.2397Lodge (DDPM)
Pose TrackingFineDancefid_k45.56Lodge (DDPM)
Pose TrackingFineDanceBAS0.2269Lodge (DDIM)
Pose TrackingFineDancefid_k50Lodge (DDIM)
Pose TrackingAIST++Beat alignment score0.24Lodge (DDPM)
Pose TrackingAIST++FID37.09Lodge (DDPM)
Motion SynthesisFineDanceBAS0.2397Lodge (DDPM)
Motion SynthesisFineDancefid_k45.56Lodge (DDPM)
Motion SynthesisFineDanceBAS0.2269Lodge (DDIM)
Motion SynthesisFineDancefid_k50Lodge (DDIM)
Motion SynthesisAIST++Beat alignment score0.24Lodge (DDPM)
Motion SynthesisAIST++FID37.09Lodge (DDPM)
10-shot image generationFineDanceBAS0.2397Lodge (DDPM)
10-shot image generationFineDancefid_k45.56Lodge (DDPM)
10-shot image generationFineDanceBAS0.2269Lodge (DDIM)
10-shot image generationFineDancefid_k50Lodge (DDIM)
10-shot image generationAIST++Beat alignment score0.24Lodge (DDPM)
10-shot image generationAIST++FID37.09Lodge (DDPM)
3D Human Pose TrackingFineDanceBAS0.2397Lodge (DDPM)
3D Human Pose TrackingFineDancefid_k45.56Lodge (DDPM)
3D Human Pose TrackingFineDanceBAS0.2269Lodge (DDIM)
3D Human Pose TrackingFineDancefid_k50Lodge (DDIM)
3D Human Pose TrackingAIST++Beat alignment score0.24Lodge (DDPM)
3D Human Pose TrackingAIST++FID37.09Lodge (DDPM)

Related Papers

DeepGesture: A conversational gesture synthesis system based on emotions and semantics2025-07-03VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and Collisions2025-06-29DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling2025-06-23PlanMoGPT: Flow-Enhanced Progressive Planning for Text to Motion Synthesis2025-06-22Motion-R1: Chain-of-Thought Reasoning and Reinforcement Learning for Human Motion Generation2025-06-12DanceChat: Large Language Model-Guided Music-to-Dance Generation2025-06-12MotionRAG-Diff: A Retrieval-Augmented Diffusion Framework for Long-Term Music-to-Dance Generation2025-06-03MotionPro: A Precise Motion Controller for Image-to-Video Generation2025-05-26