VTP with more data
Reported on 4 benchmarks across 2 tasks · 1 paper · 4 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Audio2 results
- Word Error Rate (WER)· uses extra data· 2021-10-14SOTA30.7best: 0.68 (Whisper)
- Word Error Rate (WER)· uses extra data· 2021-10-14SOTA22.6best: 2.1 (RAVEn Large)
Speech2 results
- Word Error Rate (WER)· uses extra data· 2021-10-14SOTA30.7best: 19.1 (CTC/Attention)
- Word Error Rate (WER)· uses extra data· 2021-10-14SOTA22.6