AVA-Speech

Contains densely labeled speech activity in YouTube videos, with the goal of creating a shared, available dataset for this task.

Source: AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies