Metric: EmoV (higher is better)
| # | Model↕ | EmoV▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | M2D-CLAP | 61.9 | No | M2D2: Exploring General-purpose Audio-Language R... | 2025-03-28 | Code |
| 2 | Jukebox (Pre-training: CALM) | 61.7 | No | Codified audio language modeling learns useful r... | 2021-07-12 | Code |
| 3 | M2D | 59.4 | No | M2D2: Exploring General-purpose Audio-Language R... | 2025-03-28 | Code |
| 4 | M2D2 | 59.3 | No | M2D2: Exploring General-purpose Audio-Language R... | 2025-03-28 | Code |
| 5 | CLMR (Pre-training: contrastive) | 45.8 | No | Codified audio language modeling learns useful r... | 2021-07-12 | Code |