TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/Text-to-Music Generation/MusicCaps

Text-to-Music Generation on MusicCaps

Metric: FAD (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕FAD▼Extra DataPaperDate↕Code
1Riffusion13.4NoMusicLM: Generating Music From Text2023-01-26Code
2Mubert9.6NoMusicLM: Generating Music From Text2023-01-26Code
3MeLoDy5.41NoEfficient Neural Music Generation2023-05-25-
4MusicGen w/ random melody (1.5B)5NoSimple and Controllable Music Generation2023-06-08Code
5MusicLM4NoMusicLM: Generating Music From Text2023-01-26Code
6Noise2Music spectrogram3.84NoNoise2Music: Text-conditioned Music Generation w...2023-02-08-
7MusicGen w/o melody (3.3B)3.8NoSimple and Controllable Music Generation2023-06-08Code
8UniAudio3.65No--Code
9Stable Audio Open3.51NoStable Audio Open2024-07-19Code
10MusicGen w/o melody (1.5B)3.4NoSimple and Controllable Music Generation2023-06-08Code
11AudioLDM 2-Full3.13NoAudioLDM 2: Learning Holistic Audio Generation w...2023-08-10Code
12AudioLDM2-large2.93NoAudioLDM 2: Learning Holistic Audio Generation w...2023-08-10Code
13TANGO-AF2.21NoImproving Text-To-Audio Models with Synthetic Ca...2024-06-18Code
14Noise2Music waveform2.134NoNoise2Music: Text-conditioned Music Generation w...2023-02-08-
15JEN-12NoJEN-1: Text-Guided Universal Music Generation wi...2023-08-09Code
16ETTA1.91NoETTA: Elucidating the Design Space of Text-to-Au...2024-12-26Code
17OpenMusic (QA-MDT)1.65NoQuality-aware Masked Diffusion Transformer for E...2024-05-24Code
18FLUXMusic1.43NoFLUX that Plays Music2024-09-01Code
19MeLFusion (image-conditioned)1.12NoMeLFusion: Synthesizing Music from Image and Lan...2024-06-07Code