TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Intentional Gesture: Deliver Your Intentions with Gestures...

Intentional Gesture: Deliver Your Intentions with Gestures for Speech

Pinxin Liu, Haiyang Liu, Luchuan Song, Chenliang Xu

2025-05-21Gesture Generation
PaperPDFCode

Abstract

When humans speak, gestures help convey communicative intentions, such as adding emphasis or describing concepts. However, current co-speech gesture generation methods rely solely on superficial linguistic cues (\textit{e.g.} speech audio or text transcripts), neglecting to understand and leverage the communicative intention that underpins human gestures. This results in outputs that are rhythmically synchronized with speech but are semantically shallow. To address this gap, we introduce \textbf{Intentional-Gesture}, a novel framework that casts gesture generation as an intention-reasoning task grounded in high-level communicative functions. % First, we curate the \textbf{InG} dataset by augmenting BEAT-2 with gesture-intention annotations (\textit{i.e.}, text sentences summarizing intentions), which are automatically annotated using large vision-language models. Next, we introduce the \textbf{Intentional Gesture Motion Tokenizer} to leverage these intention annotations. It injects high-level communicative functions (\textit{e.g.}, intentions) into tokenized motion representations to enable intention-aware gesture synthesis that are both temporally aligned and semantically meaningful, achieving new state-of-the-art performance on the BEAT-2 benchmark. Our framework offers a modular foundation for expressive gesture generation in digital humans and embodied AI. Project Page: https://andypinxinliu.github.io/Intentional-Gesture

Results

TaskDatasetMetricValueModel
3DBEAT2FGD0.379Intentional Gesture
3D Shape GenerationBEAT2FGD0.379Intentional Gesture

Related Papers

DeepGesture: A conversational gesture synthesis system based on emotions and semantics2025-07-03M3G: Multi-Granular Gesture Generator for Audio-Driven Full-Body Human Motion Synthesis2025-05-13Inter-Diffusion Generation Model of Speakers and Listeners for Effective Communication2025-05-08Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion2025-05-03EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation2025-04-12EasyGenNet: An Efficient Framework for Audio-Driven Gesture Video Generation Based on Diffusion Model2025-04-11Audio-driven Gesture Generation via Deviation Feature in the Latent Space2025-03-27SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain2025-03-26