Visual Speech Recognition on LRS3-TED
Metric: Word Error Rate (WER) (lower is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | Word Error Rate (WER)▲ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | CTC/Attention | 19.1 | Yes | Auto-AVSR: Audio-Visual Speech Recognition with ... | 2023-03-25 | Code |
| 2 | VTP with more data | 30.7 | Yes | Sub-word Level Lip Reading With Visual Attention | 2021-10-14 | - |
| 3 | VTP | 40.6 | Yes | Sub-word Level Lip Reading With Visual Attention | 2021-10-14 | - |