TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/Audio Classification/ESC-50

Audio Classification on ESC-50

Metric: Accuracy (5-fold) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy (5-fold)▼Extra DataPaperDate↕Code
1OmniVec299.1Yes---
2InternVideo298.6YesInternVideo2: Scaling Foundation Models for Mult...2024-03-22Code
3M2D2 AS+98.5YesM2D2: Exploring General-purpose Audio-Language R...2025-03-28Code
4OmniVec98.4YesOmniVec: Learning robust representations with cr...2023-11-07-
5BEATs98.1YesBEATs: Audio Pre-Training with Acoustic Tokenizers2022-12-18Code
6mn40_as97.45YesEfficient Large-scale Audio Tagging via Transfor...2022-11-09Code
7DyMN-L97.4YesDynamic Convolutional Neural Networks as Efficie...2023-10-24Code
8M2D-CLAP/0.797.4YesM2D-CLAP: Masked Modeling Duo Meets CLAP for Lea...2024-06-04Code
9M2D-AS/0.797.2YesMasked Modeling Duo: Towards a Universal Audio P...2024-04-09Code
10HTS-AT97YesHTS-AT: A Hierarchical Token-Semantic Audio Tran...2022-02-02Code
11EAT-M96.3YesEnd-to-End Audio Strikes Back: Boosting Augmenta...2022-04-25Code
12ERANN-2-596.1No---
13M2D/0.796YesMasked Modeling Duo: Towards a Universal Audio P...2024-04-09Code
14EAT96YesEAT: Self-Supervised Pre-Training with Efficient...2024-01-07Code
15Audio Spectrogram Transformer95.7YesAST: Audio Spectrogram Transformer2021-04-05Code
16EAT-S95.25YesEnd-to-End Audio Strikes Back: Boosting Augmenta...2022-04-25Code
17MATPAC (SSL model, linear eval)93.5NoMasked Latent Prediction and Classification for ...2025-02-17Code
18EAT-S (scratch)92.15NoEnd-to-End Audio Strikes Back: Boosting Augmenta...2022-04-25Code
19SepTr + LeRaC91.58NoLearning Rate Curriculum2022-05-18Code
20Multi-Channel Audio Feature with CNN89.5No---
21ACDNet87.1NoEnvironmental Sound Classification on the Edge: ...2021-03-05Code