Action Segmentation is a challenging problem in high-level video understanding. In its simplest form, Action Segmentation aims to segment a temporally untrimmed video by time and label each segmented part with one of pre-defined action labels. The results of Action Segmentation can be further used as input to various applications, such as video-to-text and action localization.
<span class="description-source">Source: TricorNet: A Hybrid Temporal Convolutional and Recurrent Network for Video Action Segmentation </span>