TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Audio-visual Question Answering/MUSIC-AVQA v2.0

Audio-visual Question Answering on MUSIC-AVQA v2.0

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1Meerkat79.15YesMeerkat: Audio-Visual Large Language Model for G...2024-07-01Code
2QA-TIGER76.43NoQuestion-Aware Gaussian Experts for Audio-Visual...2025-03-06Code
3LAST-Att75.44NoTackling Data Bias in MUSIC-AVQA: Crafting a Bal...2023-10-10Code
4LAVISH73.18NoVision Transformers are Parameter-Efficient Audi...2022-12-15Code
5AVST71.02NoLearning to Answer Questions in Dynamic Audio-Vi...2022-03-26Code