TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/Text-to-Music Generation/MusicCaps

Text-to-Music Generation on MusicCaps

Metric: KL_passt (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕KL_passt▼Extra DataPaperDate↕Code
1UniAudio1.87No--Code
2AudioLDM2-music1.53NoAudioLDM 2: Learning Holistic Audio Generation w...2023-08-10Code
3Stable Audio Open1.32NoStable Audio Open2024-07-19Code
4OpenMusic (QA-MDT)1.31NoQuality-aware Masked Diffusion Transformer for E...2024-05-24Code
5MusicGen w/o melody (3.3B)1.31NoSimple and Controllable Music Generation2023-06-08Code
6MusicGen w/ random melody (1.5B)1.31NoSimple and Controllable Music Generation2023-06-08Code
7JEN-11.29NoJEN-1: Text-Guided Universal Music Generation wi...2023-08-09Code
8FLUXMusic1.25NoFLUX that Plays Music2024-09-01Code
9MusicGen w/o melody (1.5B)1.23NoSimple and Controllable Music Generation2023-06-08Code
10AudioLDM 2-Full1.2NoAudioLDM 2: Learning Holistic Audio Generation w...2023-08-10Code
11AudioLDM2-large1NoAudioLDM 2: Learning Holistic Audio Generation w...2023-08-10Code
12TANGO-AF0.94NoImproving Text-To-Audio Models with Synthetic Ca...2024-06-18Code
13MeLFusion (image-conditioned)0.89NoMeLFusion: Synthesizing Music from Image and Lan...2024-06-07Code
14ETTA0.84NoETTA: Elucidating the Design Space of Text-to-Au...2024-12-26Code
15Stable Audio0.8NoFast Timing-Conditioned Latent Audio Diffusion2024-02-07Code