TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Unitary RNN

Unitary RNN

SequentialIntroduced 20003 papers
Source Paper

Description

A Unitary RNN is a recurrent neural network architecture that uses a unitary hidden to hidden matrix. Specifically they concern dynamics of the form:

h_t=f(Wh_t−1+Vx_t)h\_{t} = f\left(Wh\_{t−1} + Vx\_{t}\right)h_t=f(Wh_t−1+Vx_t)

where WWW is a unitary matrix (W†W=I)\left(W^{†}W = I\right)(W†W=I). The product of unitary matrices is a unitary matrix, so WWW can be parameterised as a product of simpler unitary matrices:

h_t=f(D_3R_2F−1D_2PR_1FD_1h_t−1+Vxt)h\_{t} = f\left(D\_{3}R\_{2}F^{−1}D\_{2}PR\_{1}FD\_{1}h\_{t−1} + Vxt\right)h_t=f(D_3R_2F−1D_2PR_1FD_1h_t−1+Vxt)

where D_3D\_{3}D_3, D_2D\_{2}D_2, D_1D\_{1}D_1 are learned diagonal complex matrices, and R_2R\_{2}R_2, R_1R\_{1}R_1 are learned reflection matrices. Matrices FFF and F−1F^{−1}F−1 are the discrete Fourier transformation and its inverse. P is any constant random permutation. The activation function f(h)f\left(h\right)f(h) applies a rectified linear unit with a learned bias to the modulus of each complex number. Only the diagonal and reflection matrices, DDD and RRR, are learned, so Unitary RNNs have fewer parameters than LSTMs with comparable numbers of hidden units.

Source: Associative LSTMs

Papers Using This Method

An AutoML-based Approach to Multimodal Image Sentiment Analysis2021-02-16Complex Unitary Recurrent Neural Networks using Scaled Cayley Transform2018-11-09Unitary Evolution Recurrent Neural Networks2015-11-20