TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/LRW

LRW

Lip Reading in the Wild

AudioTextsVideosCustom (research-only, non-commercial, attribution)Introduced 2016-01-01

The Lip Reading in the Wild (LRW) dataset a large-scale audio-visual database that contains 500 different words from over 1,000 speakers. Each utterance has 29 frames, whose boundary is centered around the target word. The database is divided into training, validation and test sets. The training set contains at least 800 utterances for each class while the validation and test sets contain 50 utterances.

Source: Towards Pose-invariant Lip-Reading Image Source: https://www.robots.ox.ac.uk/~vgg/data/lip_reading/lrw1.html

Benchmarks

1 Image, 2*2 Stitchi/LMD1 Image, 2*2 Stitchi/SSIM10-shot image generation/FID10-shot image generation/LSE-C10-shot image generation/LSE-D10-shot image generation/LMD10-shot image generation/SSIM3D/FID3D/LSE-C3D/LSE-D3D/LMD3D/SSIM3D Face Modelling/FID3D Face Modelling/LSE-C3D Face Modelling/LSE-D3D Face Modelling/LMD3D Face Modelling/SSIM3D Face Reconstruction/FID3D Face Reconstruction/LSE-C3D Face Reconstruction/LSE-D3D Face Reconstruction/LMD3D Face Reconstruction/SSIMAudio-Visual Speech Recognition/Top-1 AccuracyFace Generation/FIDFace Generation/LSE-CFace Generation/LSE-DFace Generation/LMDFace Generation/SSIMFace Reconstruction/FIDFace Reconstruction/LSE-CFace Reconstruction/LSE-DFace Reconstruction/LMDFace Reconstruction/SSIMFacial Recognition and Modelling/FIDFacial Recognition and Modelling/LSE-CFacial Recognition and Modelling/LSE-DFacial Recognition and Modelling/LMDFacial Recognition and Modelling/SSIMImage Generation/FIDImage Generation/LSE-CImage Generation/LSE-DImage Generation/LMDImage Generation/SSIMKeyword Spotting/Top-1 AccuracyKeyword Spotting/Top-5 AccuracyKeyword Spotting/mAPLip Reading/WERLip to Speech Synthesis/ESTOILip to Speech Synthesis/PESQLip to Speech Synthesis/STOILipreading/Top 1 AccuracyNatural Language Transduction/Top 1 AccuracySpeech Recognition/ESTOISpeech Recognition/PESQSpeech Recognition/STOITalking Face Generation/LMDTalking Face Generation/SSIMTalking Head Generation/FIDTalking Head Generation/LSE-CTalking Head Generation/LSE-DVisual Speech Recognition/ESTOIVisual Speech Recognition/PESQVisual Speech Recognition/STOI

Related Benchmarks

LRW-1000/Lipreading/Top-1 AccuracyLRW-1000/Natural Language Transduction/Top-1 Accuracy

Statistics

Papers
188
Benchmarks
63

Links

Homepage

Tasks

1 Image, 2*2 Stitchi10-shot image generation3D3D Face Modelling3D Face ReconstructionAudio-Visual Speech RecognitionFace GenerationFace ReconstructionFacial Recognition and ModellingImage GenerationKeyword SpottingLandmark-based LipreadingLip ReadingLip to Speech SynthesisLipreadingNatural Language TransductionSpeech RecognitionTalking Face GenerationTalking Head GenerationUnconstrained Lip-synchronizationVisual Keyword SpottingVisual Speech Recognition