Libri-Light

Public domain

Libri-Light is a collection of spoken English audio suitable for training speech recognition systems under limited or no supervision. It is derived from open-source audio books from the LibriVox project. It contains over 60K hours of audio.

Source: Libri-Light: A Benchmark for ASR with Limited or No Supervision