Metric: Accuracy (higher is better)
| # | Model↕ | Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Meerkat | 79.15 | Yes | Meerkat: Audio-Visual Large Language Model for G... | 2024-07-01 | Code |
| 2 | QA-TIGER | 76.43 | No | Question-Aware Gaussian Experts for Audio-Visual... | 2025-03-06 | Code |
| 3 | LAST-Att | 75.44 | No | Tackling Data Bias in MUSIC-AVQA: Crafting a Bal... | 2023-10-10 | Code |
| 4 | LAVISH | 73.18 | No | Vision Transformers are Parameter-Efficient Audi... | 2022-12-15 | Code |
| 5 | AVST | 71.02 | No | Learning to Answer Questions in Dynamic Audio-Vi... | 2022-03-26 | Code |