Zipformer+pruned transducer (no external language model)
Reported on 3 benchmarks across 1 task · 2 papers · 2 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Audio3 results
- Word Error Rate (WER)· 2024-10-07SOTA10.09best: 9.12 (SAMBA ASR)
- Word Error Rate (WER)· 2024-10-07SOTA10.2best: 10.03 (Zipformer+pruned transducer w/ CR-CTC (no external language model))
- Word Error Rate (WER)· 2023-10-174.38best: 2.48 (SAMBA ASR)