TV-AD

Audio Description dataset for TV series

Introduced 2024-07-22

TV-AD is a dataset that provides ground truth AD annotations that are aligned with TV series video, featuring episodes across multiple TV series including “The Big Bang Theory”, “Friends”, “Frasier”, “Seinfeld”, etc. The dataset is divided into training (TV-AD-Train, ∼31k ADs) and evaluation splits (TV-AD-Eval, ∼3k ADs), ensuring that the TV series do not overlap between the two splits. The evaluation split contains AD annotations for TV videos that are publicly available.