Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Robots
/
3D Shape Generation
/
BEAT2
3D Shape Generation on BEAT2
Metric: FGD (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
FGD (best first)
FGD (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
FGD
▼
Extra Data
Paper
Date
↕
Code
1
S2G
2.815
No
Learning Individual Styles of Conversational Ges...
2019-06-10
Code
2
Trimodal
1.241
No
Speech Gesture Generation from the Trimodal Cont...
2020-09-04
Code
3
HA2G
1.232
No
Learning Hierarchical Cross-Modal Association fo...
2022-03-24
Code
4
Habibie
0.904
No
Learning Speech-driven 3D Conversational Gesture...
2021-02-13
-
5
CaMN
0.6644
No
BEAT: A Large-Scale Semantic and Emotional Multi...
2022-03-10
Code
6
TalkShow
0.6209
No
Generating Holistic 3D Human Motion from Speech
2022-12-08
Code
7
EMAGE
0.5512
No
EMAGE: Towards Unified Holistic Co-Speech Gestur...
2023-12-31
Code
8
MambaTalk
0.5366
No
MambaTalk: Efficient Holistic Gesture Synthesis ...
2024-03-14
Code
9
Syntalker
0.4687
No
Enabling Synergistic Full-Body Control in Prompt...
2024-10-01
Code
10
EchoMask
0.4623
No
EchoMask: Speech-Queried Attention-based Mask Mo...
2025-04-12
-
11
Contexual Gesture
0.4434
No
Contextual Gesture: Co-Speech Gesture Video Gene...
2025-02-11
-
12
SemTalk
0.4278
No
SemTalk: Holistic Co-speech Motion Generation wi...
2024-12-21
-
13
GestureLSM
0.404
No
GestureLSM: Latent Shortcut based Co-Speech Gest...
2025-01-31
Code
14
Intentional Gesture
0.379
No
Intentional Gesture: Deliver Your Intentions wit...
2025-05-21
Code
#1
S2G
SOTA
2.815
FGD
· 2019-06-10
Learning Individual Styles of Conversational Gesture
Code
#2
Trimodal
1.241
FGD
· 2020-09-04
Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity
Code
#3
HA2G
1.232
FGD
· 2022-03-24
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
Code
#4
Habibie
0.904
FGD
· 2021-02-13
Learning Speech-driven 3D Conversational Gestures from Video
#5
CaMN
0.6644
FGD
· 2022-03-10
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis
Code
#6
TalkShow
0.6209
FGD
· 2022-12-08
Generating Holistic 3D Human Motion from Speech
Code
#7
EMAGE
0.5512
FGD
· 2023-12-31
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling
Code
#8
MambaTalk
0.5366
FGD
· 2024-03-14
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
Code
#9
Syntalker
0.4687
FGD
· 2024-10-01
Enabling Synergistic Full-Body Control in Prompt-Based Co-Speech Motion Generation
Code
#10
EchoMask
0.4623
FGD
· 2025-04-12
EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation
#11
Contexual Gesture
0.4434
FGD
· 2025-02-11
Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation
#12
SemTalk
0.4278
FGD
· 2024-12-21
SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis
#13
GestureLSM
0.404
FGD
· 2025-01-31
GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling
Code
#14
Intentional Gesture
0.379
FGD
· 2025-05-21
Intentional Gesture: Deliver Your Intentions with Gestures for Speech
Code