EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks

Pengfei Zhang, Jianru Xue, Cuiling Lan, Wen-Jun Zeng, Zhanning Gao, Nanning Zheng

2019-09-03Skeleton Based Action Recognition Gesture Recognition Action Recognition

Abstract

Recurrent neural networks (RNNs) are capable of modeling temporal dependencies of complex sequential data. In general, current available structures of RNNs tend to concentrate on controlling the contributions of current and previous information. However, the exploration of different importance levels of different elements within an input vector is always ignored. We propose a simple yet effective Element-wise-Attention Gate (EleAttG), which can be easily added to an RNN block (e.g. all RNN neurons in an RNN layer), to empower the RNN neurons to have attentiveness capability. For an RNN block, an EleAttG is used for adaptively modulating the input by assigning different levels of importance, i.e., attention, to each element/dimension of the input. We refer to an RNN block equipped with an EleAttG as an EleAtt-RNN block. Instead of modulating the input as a whole, the EleAttG modulates the input at fine granularity, i.e., element-wise, and the modulation is content adaptive. The proposed EleAttG, as an additional fundamental unit, is general and can be applied to any RNN structures, e.g., standard RNN, Long Short-Term Memory (LSTM), or Gated Recurrent Unit (GRU). We demonstrate the effectiveness of the proposed EleAtt-RNN by applying it to different tasks including the action recognition, from both skeleton-based data and RGB videos, gesture recognition, and sequential MNIST classification. Experiments show that adding attentiveness through EleAttGs to RNN blocks significantly improves the power of RNNs.

Results

Task	Dataset	Metric	Value	Model
Video	NTU RGB+D	Accuracy (CS)	80.7	EleAtt-GRU (aug.)
Video	NTU RGB+D	Accuracy (CV)	88.4	EleAtt-GRU (aug.)
Temporal Action Localization	NTU RGB+D	Accuracy (CS)	80.7	EleAtt-GRU (aug.)
Temporal Action Localization	NTU RGB+D	Accuracy (CV)	88.4	EleAtt-GRU (aug.)
Zero-Shot Learning	NTU RGB+D	Accuracy (CS)	80.7	EleAtt-GRU (aug.)
Zero-Shot Learning	NTU RGB+D	Accuracy (CV)	88.4	EleAtt-GRU (aug.)
Activity Recognition	NTU RGB+D	Accuracy (CS)	80.7	EleAtt-GRU (aug.)
Activity Recognition	NTU RGB+D	Accuracy (CV)	88.4	EleAtt-GRU (aug.)
Action Localization	NTU RGB+D	Accuracy (CS)	80.7	EleAtt-GRU (aug.)
Action Localization	NTU RGB+D	Accuracy (CV)	88.4	EleAtt-GRU (aug.)
Action Detection	NTU RGB+D	Accuracy (CS)	80.7	EleAtt-GRU (aug.)
Action Detection	NTU RGB+D	Accuracy (CV)	88.4	EleAtt-GRU (aug.)
3D Action Recognition	NTU RGB+D	Accuracy (CS)	80.7	EleAtt-GRU (aug.)
3D Action Recognition	NTU RGB+D	Accuracy (CV)	88.4	EleAtt-GRU (aug.)
Action Recognition	NTU RGB+D	Accuracy (CS)	80.7	EleAtt-GRU (aug.)
Action Recognition	NTU RGB+D	Accuracy (CV)	88.4	EleAtt-GRU (aug.)

EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks

Abstract

Results

Related Papers

EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks

Abstract

Results

Related Papers