Richly Activated Graph Convolutional Network for Action Recognition with Incomplete Skeletons

Yi-Fan Song, Zhang Zhang, Liang Wang

2019-05-16Skeleton Based Action Recognition Action Recognition Temporal Action Localization

Abstract

Current methods for skeleton-based human action recognition usually work with completely observed skeletons. However, in real scenarios, it is prone to capture incomplete and noisy skeletons, which will deteriorate the performance of traditional models. To enhance the robustness of action recognition models to incomplete skeletons, we propose a multi-stream graph convolutional network (GCN) for exploring sufficient discriminative features distributed over all skeleton joints. Here, each stream of the network is only responsible for learning features from currently unactivated joints, which are distinguished by the class activation maps (CAM) obtained by preceding streams, so that the activated joints of the proposed method are obviously more than traditional methods. Thus, the proposed method is termed richly activated GCN (RA-GCN), where the richly discovered features will improve the robustness of the model. Compared to the state-of-the-art methods, the RA-GCN achieves comparable performance on the NTU RGB+D dataset. Moreover, on a synthetic occlusion dataset, the performance deterioration can be alleviated by the RA-GCN significantly.

Results

Task	Dataset	Metric	Value	Model
Video	NTU RGB+D	Accuracy (CS)	85.9	3s RA-GCN
Video	NTU RGB+D	Accuracy (CV)	93.5	3s RA-GCN
Video	NTU RGB+D	Accuracy (CS)	85.8	2s RA-GCN
Video	NTU RGB+D	Accuracy (CV)	93	2s RA-GCN
Temporal Action Localization	NTU RGB+D	Accuracy (CS)	85.9	3s RA-GCN
Temporal Action Localization	NTU RGB+D	Accuracy (CV)	93.5	3s RA-GCN
Temporal Action Localization	NTU RGB+D	Accuracy (CS)	85.8	2s RA-GCN
Temporal Action Localization	NTU RGB+D	Accuracy (CV)	93	2s RA-GCN
Zero-Shot Learning	NTU RGB+D	Accuracy (CS)	85.9	3s RA-GCN
Zero-Shot Learning	NTU RGB+D	Accuracy (CV)	93.5	3s RA-GCN
Zero-Shot Learning	NTU RGB+D	Accuracy (CS)	85.8	2s RA-GCN
Zero-Shot Learning	NTU RGB+D	Accuracy (CV)	93	2s RA-GCN
Activity Recognition	NTU RGB+D	Accuracy (CS)	85.9	3s RA-GCN
Activity Recognition	NTU RGB+D	Accuracy (CV)	93.5	3s RA-GCN
Activity Recognition	NTU RGB+D	Accuracy (CS)	85.8	2s RA-GCN
Activity Recognition	NTU RGB+D	Accuracy (CV)	93	2s RA-GCN
Action Localization	NTU RGB+D	Accuracy (CS)	85.9	3s RA-GCN
Action Localization	NTU RGB+D	Accuracy (CV)	93.5	3s RA-GCN
Action Localization	NTU RGB+D	Accuracy (CS)	85.8	2s RA-GCN
Action Localization	NTU RGB+D	Accuracy (CV)	93	2s RA-GCN
Action Detection	NTU RGB+D	Accuracy (CS)	85.9	3s RA-GCN
Action Detection	NTU RGB+D	Accuracy (CV)	93.5	3s RA-GCN
Action Detection	NTU RGB+D	Accuracy (CS)	85.8	2s RA-GCN
Action Detection	NTU RGB+D	Accuracy (CV)	93	2s RA-GCN
3D Action Recognition	NTU RGB+D	Accuracy (CS)	85.9	3s RA-GCN
3D Action Recognition	NTU RGB+D	Accuracy (CV)	93.5	3s RA-GCN
3D Action Recognition	NTU RGB+D	Accuracy (CS)	85.8	2s RA-GCN
3D Action Recognition	NTU RGB+D	Accuracy (CV)	93	2s RA-GCN
Action Recognition	NTU RGB+D	Accuracy (CS)	85.9	3s RA-GCN
Action Recognition	NTU RGB+D	Accuracy (CV)	93.5	3s RA-GCN
Action Recognition	NTU RGB+D	Accuracy (CS)	85.8	2s RA-GCN
Action Recognition	NTU RGB+D	Accuracy (CV)	93	2s RA-GCN

Richly Activated Graph Convolutional Network for Action Recognition with Incomplete Skeletons

Abstract

Results

Related Papers

Richly Activated Graph Convolutional Network for Action Recognition with Incomplete Skeletons

Abstract

Results

Related Papers