TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Bailando: 3D Dance Generation by Actor-Critic GPT with Cho...

Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory

Li SiYao, Weijiang Yu, Tianpei Gu, Chunze Lin, Quan Wang, Chen Qian, Chen Change Loy, Ziwei Liu

2022-03-24CVPR 2022 1Motion Synthesis
PaperPDFCode(official)

Abstract

Driving 3D characters to dance following a piece of music is highly challenging due to the spatial constraints applied to poses by choreography norms. In addition, the generated dance sequence also needs to maintain temporal coherency with different music genres. To tackle these challenges, we propose a novel music-to-dance framework, Bailando, with two powerful components: 1) a choreographic memory that learns to summarize meaningful dancing units from 3D pose sequence to a quantized codebook, 2) an actor-critic Generative Pre-trained Transformer (GPT) that composes these units to a fluent dance coherent to the music. With the learned choreographic memory, dance generation is realized on the quantized units that meet high choreography standards, such that the generated dancing sequences are confined within the spatial constraints. To achieve synchronized alignment between diverse motion tempos and music beats, we introduce an actor-critic-based reinforcement learning scheme to the GPT with a newly-designed beat-align reward function. Extensive experiments on the standard benchmark demonstrate that our proposed framework achieves state-of-the-art performance both qualitatively and quantitatively. Notably, the learned choreographic memory is shown to discover human-interpretable dancing-style poses in an unsupervised manner.

Results

TaskDatasetMetricValueModel
Pose TrackingFineDanceBAS0.2029Bailando
Pose TrackingFineDancefid_k82.81Bailando
Pose TrackingAIST++Beat alignment score0.233Bailando
Pose TrackingAIST++FID28.16Bailando
Motion SynthesisFineDanceBAS0.2029Bailando
Motion SynthesisFineDancefid_k82.81Bailando
Motion SynthesisAIST++Beat alignment score0.233Bailando
Motion SynthesisAIST++FID28.16Bailando
10-shot image generationFineDanceBAS0.2029Bailando
10-shot image generationFineDancefid_k82.81Bailando
10-shot image generationAIST++Beat alignment score0.233Bailando
10-shot image generationAIST++FID28.16Bailando
3D Human Pose TrackingFineDanceBAS0.2029Bailando
3D Human Pose TrackingFineDancefid_k82.81Bailando
3D Human Pose TrackingAIST++Beat alignment score0.233Bailando
3D Human Pose TrackingAIST++FID28.16Bailando

Related Papers

DeepGesture: A conversational gesture synthesis system based on emotions and semantics2025-07-03VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and Collisions2025-06-29DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling2025-06-23PlanMoGPT: Flow-Enhanced Progressive Planning for Text to Motion Synthesis2025-06-22Motion-R1: Chain-of-Thought Reasoning and Reinforcement Learning for Human Motion Generation2025-06-12DanceChat: Large Language Model-Guided Music-to-Dance Generation2025-06-12MotionRAG-Diff: A Retrieval-Augmented Diffusion Framework for Long-Term Music-to-Dance Generation2025-06-03MotionPro: A Precise Motion Controller for Image-to-Video Generation2025-05-26