LRW

Lip Reading in the Wild

AudioTextsVideosCustom (research-only, non-commercial, attribution)Introduced 2016-01-01

The Lip Reading in the Wild (LRW) dataset a large-scale audio-visual database that contains 500 different words from over 1,000 speakers. Each utterance has 29 frames, whose boundary is centered around the target word. The database is divided into training, validation and test sets. The training set contains at least 800 utterances for each class while the validation and test sets contain 50 utterances.

Source: Towards Pose-invariant Lip-Reading Image Source: https://www.robots.ox.ac.uk/~vgg/data/lip_reading/lrw1.html

Benchmarks

1 Image, 2*2 Stitchi/LMD 1 Image, 2*2 Stitchi/SSIM 10-shot image generation/FID 10-shot image generation/LSE-C 10-shot image generation/LSE-D 10-shot image generation/LMD 10-shot image generation/SSIM 3D/FID 3D/LSE-C 3D/LSE-D 3D/LMD 3D/SSIM 3D Face Modelling/FID 3D Face Modelling/LSE-C 3D Face Modelling/LSE-D 3D Face Modelling/LMD 3D Face Modelling/SSIM 3D Face Reconstruction/FID 3D Face Reconstruction/LSE-C 3D Face Reconstruction/LSE-D 3D Face Reconstruction/LMD 3D Face Reconstruction/SSIM Audio-Visual Speech Recognition/Top-1 Accuracy Face Generation/FID Face Generation/LSE-C Face Generation/LSE-D Face Generation/LMD Face Generation/SSIM Face Reconstruction/FID Face Reconstruction/LSE-C Face Reconstruction/LSE-D Face Reconstruction/LMD Face Reconstruction/SSIM Facial Recognition and Modelling/FID Facial Recognition and Modelling/LSE-C Facial Recognition and Modelling/LSE-D Facial Recognition and Modelling/LMD Facial Recognition and Modelling/SSIM Image Generation/FID Image Generation/LSE-C Image Generation/LSE-D Image Generation/LMD Image Generation/SSIM Keyword Spotting/Top-1 Accuracy Keyword Spotting/Top-5 Accuracy Keyword Spotting/mAP Lip Reading/WER Lip to Speech Synthesis/ESTOI Lip to Speech Synthesis/PESQ Lip to Speech Synthesis/STOI Lipreading/Top 1 Accuracy Natural Language Transduction/Top 1 Accuracy Speech Recognition/ESTOI Speech Recognition/PESQ Speech Recognition/STOI Talking Face Generation/LMD Talking Face Generation/SSIM Talking Head Generation/FID Talking Head Generation/LSE-C Talking Head Generation/LSE-D Visual Speech Recognition/ESTOI Visual Speech Recognition/PESQ Visual Speech Recognition/STOI

Related Benchmarks

LRW-1000/Lipreading/Top-1 Accuracy LRW-1000/Natural Language Transduction/Top-1 Accuracy