Audio-Visual Speech Recognition
6 benchmarks100 papers
Audio-visual speech recognition is the task of transcribing a paired audio and visual stream into text.
Audio-visual speech recognition is the task of transcribing a paired audio and visual stream into text.