CNN + Bi-RNN + CTC (speech to letters)
Reported on 2 benchmarks across 1 task · 1 paper · 2 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Audio2 results
- Percentage error· 2014-12-17SOTA67.94best: 7.8 (Distortion-Independent + WRBN + Utterance-Wise Recurrent Dropout + Magnitude Features)
- Percentage error· 2014-12-17SOTA6.3best: 3.34 (Deep Speech 2)