TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Speech/Audio-Visual Speech Recognition/LRS2

Audio-Visual Speech Recognition on LRS2

Metric: Test WER (lower is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Test WER▲Extra DataPaperDate↕Code
1Whisper-Flamingo1.4YesWhisper-Flamingo: Integrating Visual Features in...2024-06-14Code
2CTC/Attention1.5YesAuto-AVSR: Audio-Visual Speech Recognition with ...2023-03-25Code
3MoCo + wav2vec (w/o extLM)2.6NoLeveraging Unimodal Self-Supervised Learning for...2022-02-24Code
4End2end Conformer3.7NoEnd-to-end Audio-visual Speech Recognition with ...2021-02-12Code
5LF-MMI TDNN5.9NoAudio-visual Recognition of Overlapped speech fo...2020-01-06-
6CTC/Attention7NoAudio-Visual Speech Recognition With A Hybrid CT...2018-09-28-
7TM-CTC8.2NoDeep Audio-Visual Speech Recognition2018-09-06Code
8TM-Seq2seq8.5NoDeep Audio-Visual Speech Recognition2018-09-06Code