Metric: FD (higher is better)
| # | Model↕ | FD▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Stable Audio Open | 36.42 | No | Stable Audio Open | 2024-07-19 | Code |
| 2 | TANGO-AF | 22.69 | No | Improving Text-To-Audio Models with Synthetic Ca... | 2024-06-18 | Code |
| 3 | MeLFusion (image-conditioned) | 22.65 | No | MeLFusion: Synthesizing Music from Image and Lan... | 2024-06-07 | Code |
| 4 | AudioLDM2-large | 16.34 | No | AudioLDM 2: Learning Holistic Audio Generation w... | 2023-08-10 | Code |
| 5 | ETTA | 10.06 | No | ETTA: Elucidating the Design Space of Text-to-Au... | 2024-12-26 | Code |