How2
AudioTextsVideosCC BY-SA 4.0Introduced 2018-01-01
The How2 dataset contains 13,500 videos, or 300 hours of speech, and is split into 185,187 training, 2022 development (dev), and 2361 test utterances. It has subtitles in English and crowdsourced Portuguese translations.
Source: exploring multiview correlations in open-domain videos
Related Benchmarks
How2 300h/Abstractive Text Summarization/ROUGE-LHow2 300h/Text Summarization/ROUGE-LHow2QA/Video Question Answering/AccuracyHow2QA/Zero-Shot Learning/AccuracyHow2Sign/3D/L1 errorHow2Sign/Sign Language Translation/BLEUHow2Sign/Video/FVD16How2Sign/Video Generation/FVD16How2Sign/Video Inpainting/L1 error