Kinetics 400

ImagesCreative Commons Attribution 4.0 International LicenseIntroduced 2017-05-19

The dataset contains 400 human action classes, with at least 400 video clips for each action. Each clip lasts around 10s and is taken from a different YouTube video. The actions are human focussed and cover a broad range of classes including human-object interactions such as playing instruments, as well as human-human interactions such as shaking hands.

Source: https://arxiv.org/abs/1705.06950

Image source: https://arxiv.org/abs/1705.06950