CNN + Bi-RNN + CTC (speech to letters), 25.9% WER if trainedonlyon SWB
Reported on 2 benchmarks across 1 task · 1 paper · 1 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Audio2 results
- Percentage error· 2014-12-17SOTA16best: 6.8 (IBM (LSTM+Conformer encoder-decoder))
- Percentage error· 2014-12-1712.6best: 4.3 (IBM (LSTM+Conformer encoder-decoder))