Metric: Google Speech Commands V2 35 (higher is better)
| # | Model↕ | Google Speech Commands V2 35▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | WaveFormer | 99.1 | No | - | - | - |
| 2 | QNN | 98.6 | No | - | - | Code |
| 3 | M2D | 98.5 | No | Masked Modeling Duo: Learning Representations by... | 2022-10-26 | Code |
| 4 | EAT-S | 98.15 | No | End-to-End Audio Strikes Back: Boosting Augmenta... | 2022-04-25 | Code |
| 5 | Audio Spectrogram Transformer | 98.11 | No | AST: Audio Spectrogram Transformer | 2021-04-05 | Code |
| 6 | HTS-AT | 98 | No | HTS-AT: A Hierarchical Token-Semantic Audio Tran... | 2022-02-02 | Code |
| 7 | KW-MLP | 97.56 | No | Attention-Free Keyword Spotting | 2021-10-14 | Code |
| 8 | SSAMBA | 97.4 | No | SSAMBA: Self-Supervised Audio Representation Lea... | 2024-05-20 | Code |
| 9 | TripletLoss-res15 | 97 | No | Learning Efficient Representations for Keyword S... | 2021-01-12 | Code |
| 10 | ImportantAug | 95 | No | ImportantAug: a data augmentation agent for speech | 2021-12-14 | Code |
| 11 | Attention RNN | 93.9 | No | A neural attention model for speech command reco... | 2018-08-27 | Code |