TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Methodology/Classification/ESC-50

Classification on ESC-50

Metric: Top-1 Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Top-1 Accuracy▼AugmentationsPaperDate↕Code
1OmniVec299.1Yes---
2InternVideo298.6YesInternVideo2: Scaling Foundation Models for Mult...2024-03-22Code
3M2D2 AS+98.5YesM2D2: Exploring General-purpose Audio-Language R...2025-03-28Code
4OmniVec98.4YesOmniVec: Learning robust representations with cr...2023-11-07-
5BEATs98.1YesBEATs: Audio Pre-Training with Acoustic Tokenizers2022-12-18Code
6mn40_as97.45YesEfficient Large-scale Audio Tagging via Transfor...2022-11-09Code
7DyMN-L97.4YesDynamic Convolutional Neural Networks as Efficie...2023-10-24Code
8M2D-CLAP/0.797.4YesM2D-CLAP: Masked Modeling Duo Meets CLAP for Lea...2024-06-04Code
9M2D-AS/0.797.2YesMasked Modeling Duo: Towards a Universal Audio P...2024-04-09Code
10HTS-AT97YesHTS-AT: A Hierarchical Token-Semantic Audio Tran...2022-02-02Code
11EAT-M96.3YesEnd-to-End Audio Strikes Back: Boosting Augmenta...2022-04-25Code
12LHGNN96.2NoLHGNN: Local-Higher Order Graph Neural Networks ...2025-01-07-
13ERANN-2-596.1No---
14M2D/0.796YesMasked Modeling Duo: Towards a Universal Audio P...2024-04-09Code
15EAT96YesEAT: Self-Supervised Pre-Training with Efficient...2024-01-07Code
16Audio Spectrogram Transformer95.7YesAST: Audio Spectrogram Transformer2021-04-05Code
17EAT-S95.25YesEnd-to-End Audio Strikes Back: Boosting Augmenta...2022-04-25Code
18MATPAC (SSL model, linear eval)93.5NoMasked Latent Prediction and Classification for ...2025-02-17Code
19EAT-S (scratch)92.15NoEnd-to-End Audio Strikes Back: Boosting Augmenta...2022-04-25Code
20SepTr + LeRaC91.58NoLearning Rate Curriculum2022-05-18Code
21SepTr91.13NoSepTr: Separable Transformer for Audio Spectrogr...2022-03-17Code
22Multi-Format Contrastive90.5YesMulti-Format Contrastive Learning of Audio Repre...2021-03-11-
23Multi-Channel Audio Feature with CNN89.5No---
24AVID89.2NoAudio-Visual Instance Discrimination with Cross-...2020-04-27Code
25ACDNet87.1NoEnvironmental Sound Classification on the Edge: ...2021-03-05Code
26XDC85.4NoSelf-Supervised Learning by Cross-Modal Audio-Vi...2019-11-28Code
27XDC84.8NoSelf-Supervised Learning by Cross-Modal Audio-Vi...2019-11-28Code
28AVTS82.3NoCooperative Learning of Audio and Video Models f...2018-06-30-
29L379.3NoLook, Listen and Learn2017-05-23Code