AV-HuBERT Large
Reported on 4 benchmarks across 4 tasks · 2 papers · 4 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Audio1 result
- Word Error Rate (WER)· uses extra data· 2022-01-05SOTA1.3best: 0.68 (Whisper)
Speech1 result
- Word Error Rate (WER)· uses extra data· 2022-01-05SOTA1.4best: 0.74 (MMS-LLaMA)
Computer Vision1 result
- Word Error Rate (WER)· uses extra data· 2022-01-05SOTA26.9best: 12.8 (LP + Conformer)
Natural Language Processing1 result
- Word Error Rate (WER)· uses extra data· 2022-01-05SOTA26.9best: 12.8 (LP + Conformer)