Action Segmentation with Mixed Temporal Domain Adaptation

Min-Hung Chen, Baopu Li, Yingze Bao, Ghassan AlRegib

2021-04-15Action Segmentation Domain Adaptation

Abstract

The main progress for action segmentation comes from densely-annotated data for fully-supervised learning. Since manual annotation for frame-level actions is time-consuming and challenging, we propose to exploit auxiliary unlabeled videos, which are much easier to obtain, by shaping this problem as a domain adaptation (DA) problem. Although various DA techniques have been proposed in recent years, most of them have been developed only for the spatial direction. Therefore, we propose Mixed Temporal Domain Adaptation (MTDA) to jointly align frame- and video-level embedded feature spaces across domains, and further integrate with the domain attention mechanism to focus on aligning the frame-level features with higher domain discrepancy, leading to more effective domain adaptation. Finally, we evaluate our proposed methods on three challenging datasets (GTEA, 50Salads, and Breakfast), and validate that MTDA outperforms the current state-of-the-art methods on all three datasets by large margins (e.g. 6.4% gain on F1@50 and 6.8% gain on the edit score for GTEA).

Results

Task	Dataset	Metric	Value	Model
Action Localization	50 Salads	Acc	83.2	DA
Action Localization	50 Salads	Edit	75.2	DA
Action Localization	50 Salads	F1@10%	82	DA
Action Localization	50 Salads	F1@25%	80.1	DA
Action Localization	50 Salads	F1@50%	72.5	DA
Action Localization	GTEA	Acc	80	DA
Action Localization	GTEA	Edit	85.8	DA
Action Localization	GTEA	F1@10%	90.5	DA
Action Localization	GTEA	F1@25%	88.4	DA
Action Localization	GTEA	F1@50%	76.2	DA
Action Localization	Breakfast	Acc	71	DA
Action Localization	Breakfast	Average F1	66.4	DA
Action Localization	Breakfast	Edit	73.6	DA
Action Localization	Breakfast	F1@10%	74.2	DA
Action Localization	Breakfast	F1@25%	68.6	DA
Action Localization	Breakfast	F1@50%	56.5	DA
Action Segmentation	50 Salads	Acc	83.2	DA
Action Segmentation	50 Salads	Edit	75.2	DA
Action Segmentation	50 Salads	F1@10%	82	DA
Action Segmentation	50 Salads	F1@25%	80.1	DA
Action Segmentation	50 Salads	F1@50%	72.5	DA
Action Segmentation	GTEA	Acc	80	DA
Action Segmentation	GTEA	Edit	85.8	DA
Action Segmentation	GTEA	F1@10%	90.5	DA
Action Segmentation	GTEA	F1@25%	88.4	DA
Action Segmentation	GTEA	F1@50%	76.2	DA
Action Segmentation	Breakfast	Acc	71	DA
Action Segmentation	Breakfast	Average F1	66.4	DA
Action Segmentation	Breakfast	Edit	73.6	DA
Action Segmentation	Breakfast	F1@10%	74.2	DA
Action Segmentation	Breakfast	F1@25%	68.6	DA
Action Segmentation	Breakfast	F1@50%	56.5	DA

Action Segmentation with Mixed Temporal Domain Adaptation

Abstract

Results

Related Papers

Action Segmentation with Mixed Temporal Domain Adaptation

Abstract

Results

Related Papers