TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/Audio Generation/AudioCaps

Audio Generation on AudioCaps

Metric: KL_passt (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕KL_passt▼Extra DataPaperDate↕Code
1Stable Audio2.89NoFast Timing-Conditioned Latent Audio Diffusion2024-02-07Code
2Stable Audio 2.02.7NoLong-form music generation with latent diffusion2024-04-16Code
3Stable Audio Open2.14NoStable Audio Open2024-07-19Code
4AudioLDM2-large1.68NoAudioLDM 2: Learning Holistic Audio Generation w...2023-08-10Code
5AudioGen1.42NoAudioGen: Textually Guided Audio Generation2022-09-30Code
6TangoFlux-base1.23NoTangoFlux: Super Fast and Faithful Text to Audio...2024-12-30Code
7ETTA1.22NoETTA: Elucidating the Design Space of Text-to-Au...2024-12-26Code
8TangoFlux1.15NoTangoFlux: Super Fast and Faithful Text to Audio...2024-12-30Code
9ETTA-FT-AC-100k1.13NoETTA: Elucidating the Design Space of Text-to-Au...2024-12-26Code