TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/Speech Recognition/LRS2

Speech Recognition on LRS2

Metric: Test WER (lower is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Test WER▲Extra DataPaperDate↕Code
1Whisper1.3YesWhisper-Flamingo: Integrating Visual Features in...2024-06-14Code
2CTC/Attention1.5YesAuto-AVSR: Audio-Visual Speech Recognition with ...2023-03-25Code
3MoCo + wav2vec (w/o extLM)2.7NoLeveraging Unimodal Self-Supervised Learning for...2022-02-24Code
4End2end Conformer3.9NoEnd-to-end Audio-visual Speech Recognition with ...2021-02-12Code
5Whisper-LLaMA6.6NoWhispering LLaMA: A Cross-Modal Generative Error...2023-10-10Code
6LF-MMI TDNN6.7NoAudio-visual Recognition of Overlapped speech fo...2020-01-06-
7CTC/attention8.2NoAudio-Visual Speech Recognition With A Hybrid CT...2018-09-28-
8TM-seq2seq9.7NoDeep Audio-Visual Speech Recognition2018-09-06Code
9TM-CTC10.1NoDeep Audio-Visual Speech Recognition2018-09-06Code