TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/Audio Generation/AudioCaps

Audio Generation on AudioCaps

Metric: IS (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕IS▼Extra DataPaperDate↕Code
1ETTA14.36NoETTA: Elucidating the Design Space of Text-to-Au...2024-12-26Code
2ETTA-FT-AC-100k14.29NoETTA: Elucidating the Design Space of Text-to-Au...2024-12-26Code
3Audiobox Sound12.7NoAudiobox: Unified Audio Generation with Natural ...2023-12-25-
4TangoFlux12.2NoTangoFlux: Super Fast and Faithful Text to Audio...2024-12-30Code
5Tango-AF&AC-FT-AC11.04NoImproving Text-To-Audio Models with Synthetic Ca...2024-06-18Code
6TangoFlux-base10.7NoTangoFlux: Super Fast and Faithful Text to Audio...2024-12-30Code
7AudioLDM2-large8.55NoAudioLDM 2: Learning Holistic Audio Generation w...2023-08-10Code