An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition

Chenyang Si, Wentao Chen, Wei Wang, Liang Wang, Tieniu Tan

2019-02-25CVPR 2019 6Skeleton Based Action Recognition Action Recognition Temporal Action Localization

Abstract

Skeleton-based action recognition is an important task that requires the adequate understanding of movement characteristics of a human action from the given skeleton sequence. Recent studies have shown that exploring spatial and temporal features of the skeleton sequence is vital for this task. Nevertheless, how to effectively extract discriminative spatial and temporal features is still a challenging problem. In this paper, we propose a novel Attention Enhanced Graph Convolutional LSTM Network (AGC-LSTM) for human action recognition from skeleton data. The proposed AGC-LSTM can not only capture discriminative features in spatial configuration and temporal dynamics but also explore the co-occurrence relationship between spatial and temporal domains. We also present a temporal hierarchical architecture to increases temporal receptive fields of the top AGC-LSTM layer, which boosts the ability to learn the high-level semantic representation and significantly reduces the computation cost. Furthermore, to select discriminative spatial information, the attention mechanism is employed to enhance information of key joints in each AGC-LSTM layer. Experimental results on two datasets are provided: NTU RGB+D dataset and Northwestern-UCLA dataset. The comparison results demonstrate the effectiveness of our approach and show that our approach outperforms the state-of-the-art methods on both datasets.

Results

Task	Dataset	Metric	Value	Model
Video	NTU RGB+D	Accuracy (CS)	89.2	AGC-LSTM (Joint&Part)
Video	NTU RGB+D	Accuracy (CV)	95	AGC-LSTM (Joint&Part)
Temporal Action Localization	NTU RGB+D	Accuracy (CS)	89.2	AGC-LSTM (Joint&Part)
Temporal Action Localization	NTU RGB+D	Accuracy (CV)	95	AGC-LSTM (Joint&Part)
Zero-Shot Learning	NTU RGB+D	Accuracy (CS)	89.2	AGC-LSTM (Joint&Part)
Zero-Shot Learning	NTU RGB+D	Accuracy (CV)	95	AGC-LSTM (Joint&Part)
Activity Recognition	NTU RGB+D	Accuracy (CS)	89.2	AGC-LSTM (Joint&Part)
Activity Recognition	NTU RGB+D	Accuracy (CV)	95	AGC-LSTM (Joint&Part)
Action Localization	NTU RGB+D	Accuracy (CS)	89.2	AGC-LSTM (Joint&Part)
Action Localization	NTU RGB+D	Accuracy (CV)	95	AGC-LSTM (Joint&Part)
Action Detection	NTU RGB+D	Accuracy (CS)	89.2	AGC-LSTM (Joint&Part)
Action Detection	NTU RGB+D	Accuracy (CV)	95	AGC-LSTM (Joint&Part)
3D Action Recognition	NTU RGB+D	Accuracy (CS)	89.2	AGC-LSTM (Joint&Part)
3D Action Recognition	NTU RGB+D	Accuracy (CV)	95	AGC-LSTM (Joint&Part)
Action Recognition	NTU RGB+D	Accuracy (CS)	89.2	AGC-LSTM (Joint&Part)
Action Recognition	NTU RGB+D	Accuracy (CV)	95	AGC-LSTM (Joint&Part)

An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition

Abstract

Results

Related Papers

An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition

Abstract

Results

Related Papers