PaSST-RoBERTa & Estimated Audio–Caption Correspondences
Reported on 4 benchmarks across 1 task · 1 paper · 4 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Audio4 results
- R@1· uses extra data· 2024-08-21SOTA27.69
- R@10· uses extra data· 2024-08-21SOTA70.39
- R@5· uses extra data· 2024-08-21SOTA57.03
- mAP@10· uses extra data· 2024-08-21SOTA40.14