TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ParCo: Part-Coordinating Text-to-Motion Synthesis

ParCo: Part-Coordinating Text-to-Motion Synthesis

Qiran Zou, Shangyuan Yuan, Shian Du, Yu Wang, Chang Liu, Yi Xu, Jie Chen, Xiangyang Ji

2024-03-27Motion Synthesis
PaperPDFCode(official)

Abstract

We study a challenging task: text-to-motion synthesis, aiming to generate motions that align with textual descriptions and exhibit coordinated movements. Currently, the part-based methods introduce part partition into the motion synthesis process to achieve finer-grained generation. However, these methods encounter challenges such as the lack of coordination between different part motions and difficulties for networks to understand part concepts. Moreover, introducing finer-grained part concepts poses computational complexity challenges. In this paper, we propose Part-Coordinating Text-to-Motion Synthesis (ParCo), endowed with enhanced capabilities for understanding part motions and communication among different part motion generators, ensuring a coordinated and fined-grained motion synthesis. Specifically, we discretize whole-body motion into multiple part motions to establish the prior concept of different parts. Afterward, we employ multiple lightweight generators designed to synthesize different part motions and coordinate them through our part coordination module. Our approach demonstrates superior performance on common benchmarks with economic computations, including HumanML3D and KIT-ML, providing substantial evidence of its effectiveness. Code is available at https://github.com/qrzou/ParCo .

Results

TaskDatasetMetricValueModel
Pose TrackingHumanML3DDiversity9.576ParCo
Pose TrackingHumanML3DFID0.109ParCo
Pose TrackingHumanML3DMultimodality1.382ParCo
Pose TrackingHumanML3DR Precision Top30.801ParCo
Pose TrackingKIT Motion-LanguageDiversity10.95ParCo
Pose TrackingKIT Motion-LanguageFID0.453ParCo
Pose TrackingKIT Motion-LanguageMultimodality1.245ParCo
Pose TrackingKIT Motion-LanguageR Precision Top30.772ParCo
Motion SynthesisHumanML3DDiversity9.576ParCo
Motion SynthesisHumanML3DFID0.109ParCo
Motion SynthesisHumanML3DMultimodality1.382ParCo
Motion SynthesisHumanML3DR Precision Top30.801ParCo
Motion SynthesisKIT Motion-LanguageDiversity10.95ParCo
Motion SynthesisKIT Motion-LanguageFID0.453ParCo
Motion SynthesisKIT Motion-LanguageMultimodality1.245ParCo
Motion SynthesisKIT Motion-LanguageR Precision Top30.772ParCo
10-shot image generationHumanML3DDiversity9.576ParCo
10-shot image generationHumanML3DFID0.109ParCo
10-shot image generationHumanML3DMultimodality1.382ParCo
10-shot image generationHumanML3DR Precision Top30.801ParCo
10-shot image generationKIT Motion-LanguageDiversity10.95ParCo
10-shot image generationKIT Motion-LanguageFID0.453ParCo
10-shot image generationKIT Motion-LanguageMultimodality1.245ParCo
10-shot image generationKIT Motion-LanguageR Precision Top30.772ParCo
3D Human Pose TrackingHumanML3DDiversity9.576ParCo
3D Human Pose TrackingHumanML3DFID0.109ParCo
3D Human Pose TrackingHumanML3DMultimodality1.382ParCo
3D Human Pose TrackingHumanML3DR Precision Top30.801ParCo
3D Human Pose TrackingKIT Motion-LanguageDiversity10.95ParCo
3D Human Pose TrackingKIT Motion-LanguageFID0.453ParCo
3D Human Pose TrackingKIT Motion-LanguageMultimodality1.245ParCo
3D Human Pose TrackingKIT Motion-LanguageR Precision Top30.772ParCo

Related Papers

DeepGesture: A conversational gesture synthesis system based on emotions and semantics2025-07-03VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and Collisions2025-06-29DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling2025-06-23PlanMoGPT: Flow-Enhanced Progressive Planning for Text to Motion Synthesis2025-06-22Motion-R1: Chain-of-Thought Reasoning and Reinforcement Learning for Human Motion Generation2025-06-12DanceChat: Large Language Model-Guided Music-to-Dance Generation2025-06-12MotionRAG-Diff: A Retrieval-Augmented Diffusion Framework for Long-Term Music-to-Dance Generation2025-06-03MotionPro: A Precise Motion Controller for Image-to-Video Generation2025-05-26