Metric: Google Speech Commands V2 12 (higher is better)
| # | Model↕ | Google Speech Commands V2 12▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | WaveFormer | 98.8 | No | - | - | - |
| 2 | BC-ResNet-8 | 98.7 | No | Broadcasted Residual Learning for Efficient Keyw... | 2021-06-08 | Code |
| 3 | Wav2KWS | 98.5 | No | - | - | Code |
| 4 | TripletLoss-res15 | 98.37 | No | Learning Efficient Representations for Keyword S... | 2021-01-12 | Code |
| 5 | ConvMixer | 98.2 | No | - | - | Code |
| 6 | EdgeCRNN 2.0× | 98.05 | No | - | - | - |
| 7 | MHAtt-RNN | 98 | No | - | - | Code |
| 8 | Embedding + Head | 97.7 | No | Training Keyword Spotters with Limited and Synth... | 2020-01-31 | Code |
| 9 | MatchboxNet-3x2x64 | 97.63 | No | MatchboxNet: 1D Time-Channel Separable Convoluti... | 2020-04-21 | Code |
| 10 | Head without Embedding | 97.4 | No | Training Keyword Spotters with Limited and Synth... | 2020-01-31 | Code |
| 11 | Attention RNN | 96.9 | No | A neural attention model for speech command reco... | 2018-08-27 | Code |
| 12 | TC-ResNet14-1.5 | 96.6 | No | Temporal Convolution for Real-time Keyword Spott... | 2019-04-08 | Code |
| 13 | End-to-end KWS model | 95.55 | No | End-to-end Keyword Spotting using Neural Archite... | 2021-04-14 | - |
| 14 | MicroNet-KWS-L | 95.3 | No | MicroNets: Neural Network Architectures for Depl... | 2020-10-21 | Code |