TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Methodology/Classification/VGGSound

Classification on VGGSound

Metric: Top 5 Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Top 5 Accuracy▼AugmentationsPaperDate↕Code
1MMT (Audio-Visual)85.7No---
2MBT (AV)85.6NoAttention Bottlenecks for Multimodal Fusion2021-06-30Code
3AVT (Audio-Visual)85No---
4MAST (Audio Only)81.3NoMultiscale Audio Spectrogram Transformer for Eff...2023-03-19-
5PlayItBackX379.2NoPlay It Back: Iterative Attention for Audio Reco...2022-10-20Code
6MBT (A)78.1NoAttention Bottlenecks for Multimodal Fusion2021-06-30Code
7MMT (Video)77.9No---
8AVT (V)74.8No---
9MBT (V)72.6NoAttention Bottlenecks for Multimodal Fusion2021-06-30Code