TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/Audio Generation/AudioCaps

Audio Generation on AudioCaps

Metric: FD_openl3 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕FD_openl3▼Extra DataPaperDate↕Code
1AudioGen185.53NoAudioGen: Textually Guided Audio Generation2022-09-30Code
2AudioLDM2-large158.04NoAudioLDM 2: Learning Holistic Audio Generation w...2023-08-10Code
3Stable Audio 2.0110.62NoLong-form music generation with latent diffusion2024-04-16Code
4Stable Audio103.66NoFast Timing-Conditioned Latent Audio Diffusion2024-02-07Code
5ETTA80.13NoETTA: Elucidating the Design Space of Text-to-Au...2024-12-26Code
6TangoFlux-base79.7NoTangoFlux: Super Fast and Faithful Text to Audio...2024-12-30Code
7Stable Audio Open78.24NoStable Audio Open2024-07-19Code
8TangoFlux75.1NoTangoFlux: Super Fast and Faithful Text to Audio...2024-12-30Code
9ETTA-FT-AC-100k61.79NoETTA: Elucidating the Design Space of Text-to-Au...2024-12-26Code