Graph Convolutional Networks for Temporal Action Localization

Runhao Zeng, Wenbing Huang, Mingkui Tan, Yu Rong, Peilin Zhao, Junzhou Huang, Chuang Gan

2019-09-07ICCV 2019 10Action Classification Action Localization Temporal Action Localization

Abstract

Most state-of-the-art action localization systems process each action proposal individually, without explicitly exploiting their relations during learning. However, the relations between proposals actually play an important role in action localization, since a meaningful action always consists of multiple proposals in a video. In this paper, we propose to exploit the proposal-proposal relations using Graph Convolutional Networks (GCNs). First, we construct an action proposal graph, where each proposal is represented as a node and their relations between two proposals as an edge. Here, we use two types of relations, one for capturing the context information for each proposal and the other one for characterizing the correlations between distinct actions. Then we apply the GCNs over the graph to model the relations among different proposals and learn powerful representations for the action classification and localization. Experimental results show that our approach significantly outperforms the state-of-the-art on THUMOS14 (49.1% versus 42.8%). Moreover, augmentation experiments on ActivityNet also verify the efficacy of modeling action proposal relationships. Codes are available at https://github.com/Alvin-Zeng/PGCN.

Results

Task	Dataset	Metric	Value	Model
Video	ActivityNet-1.3	mAP	31.11	P-GCN
Video	ActivityNet-1.3	mAP IOU@0.5	48.26	P-GCN
Video	ActivityNet-1.3	mAP IOU@0.75	33.16	P-GCN
Video	ActivityNet-1.3	mAP IOU@0.95	3.27	P-GCN
Video	THUMOS’14	mAP IOU@0.1	69.5	P-GCN
Video	THUMOS’14	mAP IOU@0.2	67.8	P-GCN
Video	THUMOS’14	mAP IOU@0.3	63.6	P-GCN
Video	THUMOS’14	mAP IOU@0.4	57.8	P-GCN
Video	THUMOS’14	mAP IOU@0.5	49.1	P-GCN
Temporal Action Localization	ActivityNet-1.3	mAP	31.11	P-GCN
Temporal Action Localization	ActivityNet-1.3	mAP IOU@0.5	48.26	P-GCN
Temporal Action Localization	ActivityNet-1.3	mAP IOU@0.75	33.16	P-GCN
Temporal Action Localization	ActivityNet-1.3	mAP IOU@0.95	3.27	P-GCN
Temporal Action Localization	THUMOS’14	mAP IOU@0.1	69.5	P-GCN
Temporal Action Localization	THUMOS’14	mAP IOU@0.2	67.8	P-GCN
Temporal Action Localization	THUMOS’14	mAP IOU@0.3	63.6	P-GCN
Temporal Action Localization	THUMOS’14	mAP IOU@0.4	57.8	P-GCN
Temporal Action Localization	THUMOS’14	mAP IOU@0.5	49.1	P-GCN
Zero-Shot Learning	ActivityNet-1.3	mAP	31.11	P-GCN
Zero-Shot Learning	ActivityNet-1.3	mAP IOU@0.5	48.26	P-GCN
Zero-Shot Learning	ActivityNet-1.3	mAP IOU@0.75	33.16	P-GCN
Zero-Shot Learning	ActivityNet-1.3	mAP IOU@0.95	3.27	P-GCN
Zero-Shot Learning	THUMOS’14	mAP IOU@0.1	69.5	P-GCN
Zero-Shot Learning	THUMOS’14	mAP IOU@0.2	67.8	P-GCN
Zero-Shot Learning	THUMOS’14	mAP IOU@0.3	63.6	P-GCN
Zero-Shot Learning	THUMOS’14	mAP IOU@0.4	57.8	P-GCN
Zero-Shot Learning	THUMOS’14	mAP IOU@0.5	49.1	P-GCN
Action Localization	ActivityNet-1.3	mAP	31.11	P-GCN
Action Localization	ActivityNet-1.3	mAP IOU@0.5	48.26	P-GCN
Action Localization	ActivityNet-1.3	mAP IOU@0.75	33.16	P-GCN
Action Localization	ActivityNet-1.3	mAP IOU@0.95	3.27	P-GCN
Action Localization	THUMOS’14	mAP IOU@0.1	69.5	P-GCN
Action Localization	THUMOS’14	mAP IOU@0.2	67.8	P-GCN
Action Localization	THUMOS’14	mAP IOU@0.3	63.6	P-GCN
Action Localization	THUMOS’14	mAP IOU@0.4	57.8	P-GCN
Action Localization	THUMOS’14	mAP IOU@0.5	49.1	P-GCN

Graph Convolutional Networks for Temporal Action Localization

Abstract

Results

Related Papers

Graph Convolutional Networks for Temporal Action Localization

Abstract

Results

Related Papers