Text to Audio Retrieval on AudioCaps
Metric: R@5 (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | R@5▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | ONE-PEACE | 77.5 | Yes | ONE-PEACE: Exploring One General Representation ... | 2023-05-18 | Code |
| 2 | VAST | 76.8 | Yes | VAST: A Vision-Audio-Subtitle-Text Omni-Modality... | 2023-05-29 | Code |
| 3 | VALOR | 73.9 | Yes | VALOR: Vision-Audio-Language Omni-Perception Pre... | 2023-04-17 | Code |