3D Shape Generation on BEAT2

Metric: FGD (higher is better)

LeaderboardDataset

Loading chart...

Results

Sort:

#	Model↕	FGD▼	Extra Data	Paper	Date↕	Code
1	S2G	2.815	No	Learning Individual Styles of Conversational Ges...	2019-06-10	Code
2	Trimodal	1.241	No	Speech Gesture Generation from the Trimodal Cont...	2020-09-04	Code
3	HA2G	1.232	No	Learning Hierarchical Cross-Modal Association fo...	2022-03-24	Code
4	Habibie	0.904	No	Learning Speech-driven 3D Conversational Gesture...	2021-02-13	-
5	CaMN	0.6644	No	BEAT: A Large-Scale Semantic and Emotional Multi...	2022-03-10	Code
6	TalkShow	0.6209	No	Generating Holistic 3D Human Motion from Speech	2022-12-08	Code
7	EMAGE	0.5512	No	EMAGE: Towards Unified Holistic Co-Speech Gestur...	2023-12-31	Code
8	MambaTalk	0.5366	No	MambaTalk: Efficient Holistic Gesture Synthesis ...	2024-03-14	Code
9	Syntalker	0.4687	No	Enabling Synergistic Full-Body Control in Prompt...	2024-10-01	Code
10	EchoMask	0.4623	No	EchoMask: Speech-Queried Attention-based Mask Mo...	2025-04-12	-
11	Contexual Gesture	0.4434	No	Contextual Gesture: Co-Speech Gesture Video Gene...	2025-02-11	-
12	SemTalk	0.4278	No	SemTalk: Holistic Co-speech Motion Generation wi...	2024-12-21	-
13	GestureLSM	0.404	No	GestureLSM: Latent Shortcut based Co-Speech Gest...	2025-01-31	Code
14	Intentional Gesture	0.379	No	Intentional Gesture: Deliver Your Intentions wit...	2025-05-21	Code

#1S2GSOTA
2.815
FGD· 2019-06-10
Learning Individual Styles of Conversational Gesture Code
#2Trimodal
1.241
FGD· 2020-09-04
Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity Code
#3HA2G
1.232
FGD· 2022-03-24
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation Code
#4Habibie
0.904
FGD· 2021-02-13
Learning Speech-driven 3D Conversational Gestures from Video
#5CaMN
0.6644
FGD· 2022-03-10
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis Code
#6TalkShow
0.6209
FGD· 2022-12-08
Generating Holistic 3D Human Motion from Speech Code
#7EMAGE
0.5512
FGD· 2023-12-31
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling Code
#8MambaTalk
0.5366
FGD· 2024-03-14
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models Code
#9Syntalker
0.4687
FGD· 2024-10-01
Enabling Synergistic Full-Body Control in Prompt-Based Co-Speech Motion Generation Code
#10EchoMask
0.4623
FGD· 2025-04-12
EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation
#11Contexual Gesture
0.4434
FGD· 2025-02-11
Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation
#12SemTalk
0.4278
FGD· 2024-12-21
SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis
#13GestureLSM
0.404
FGD· 2025-01-31
GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling Code
#14Intentional Gesture
0.379
FGD· 2025-05-21
Intentional Gesture: Deliver Your Intentions with Gestures for Speech Code