Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Audio
/
Text-to-Music Generation
/
MusicCaps
Text-to-Music Generation on MusicCaps
Metric: FAD (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
#
Model
↕
FAD
▼
Extra Data
Paper
Date
↕
Code
1
Riffusion
13.4
No
MusicLM: Generating Music From Text
2023-01-26
Code
2
Mubert
9.6
No
MusicLM: Generating Music From Text
2023-01-26
Code
3
MeLoDy
5.41
No
Efficient Neural Music Generation
2023-05-25
-
4
MusicGen w/ random melody (1.5B)
5
No
Simple and Controllable Music Generation
2023-06-08
Code
5
MusicLM
4
No
MusicLM: Generating Music From Text
2023-01-26
Code
6
Noise2Music spectrogram
3.84
No
Noise2Music: Text-conditioned Music Generation w...
2023-02-08
-
7
MusicGen w/o melody (3.3B)
3.8
No
Simple and Controllable Music Generation
2023-06-08
Code
8
UniAudio
3.65
No
-
-
Code
9
Stable Audio Open
3.51
No
Stable Audio Open
2024-07-19
Code
10
MusicGen w/o melody (1.5B)
3.4
No
Simple and Controllable Music Generation
2023-06-08
Code
11
AudioLDM 2-Full
3.13
No
AudioLDM 2: Learning Holistic Audio Generation w...
2023-08-10
Code
12
AudioLDM2-large
2.93
No
AudioLDM 2: Learning Holistic Audio Generation w...
2023-08-10
Code
13
TANGO-AF
2.21
No
Improving Text-To-Audio Models with Synthetic Ca...
2024-06-18
Code
14
Noise2Music waveform
2.134
No
Noise2Music: Text-conditioned Music Generation w...
2023-02-08
-
15
JEN-1
2
No
JEN-1: Text-Guided Universal Music Generation wi...
2023-08-09
Code
16
ETTA
1.91
No
ETTA: Elucidating the Design Space of Text-to-Au...
2024-12-26
Code
17
OpenMusic (QA-MDT)
1.65
No
Quality-aware Masked Diffusion Transformer for E...
2024-05-24
Code
18
FLUXMusic
1.43
No
FLUX that Plays Music
2024-09-01
Code
19
MeLFusion (image-conditioned)
1.12
No
MeLFusion: Synthesizing Music from Image and Lan...
2024-06-07
Code