TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/UnICORNN: A recurrent model for learning very long time de...

UnICORNN: A recurrent model for learning very long time dependencies

T. Konstantin Rusch, Siddhartha Mishra

2021-03-09Sentiment AnalysisSequential Image ClassificationTime Series AnalysisTime Series Classification
PaperPDFCode(official)

Abstract

The design of recurrent neural networks (RNNs) to accurately process sequential inputs with long-time dependencies is very challenging on account of the exploding and vanishing gradient problem. To overcome this, we propose a novel RNN architecture which is based on a structure preserving discretization of a Hamiltonian system of second-order ordinary differential equations that models networks of oscillators. The resulting RNN is fast, invertible (in time), memory efficient and we derive rigorous bounds on the hidden state gradients to prove the mitigation of the exploding and vanishing gradient problem. A suite of experiments are presented to demonstrate that the proposed RNN provides state of the art performance on a variety of learning tasks with (very) long-time dependencies.

Results

TaskDatasetMetricValueModel
Time Series ClassificationEigenWorms% Test Accuracy90.3UnICORNN
Time Series ClassificationEigenWorms% Test Accuracy86.7coRNN
Time Series ClassificationEigenWorms% Test Accuracy49.7IndRNN
Time Series ClassificationEigenWorms% Test Accuracy40expRNN
Sentiment AnalysisIMDbAccuracy88.4UnICORNN
Image Classificationnoise padded CIFAR-10% Test Accuracy62.4UnICORNN
Image ClassificationSequential MNISTPermuted Accuracy98.4UnICORNN

Related Papers

AdaptiSent: Context-Aware Adaptive Attention for Multimodal Aspect-Based Sentiment Analysis2025-07-17Emergence of Functionally Differentiated Structures via Mutual Information Optimization in Recurrent Neural Networks2025-07-17AI Wizards at CheckThat! 2025: Enhancing Transformer-Based Embeddings with Sentiment for Subjectivity Detection in News Articles2025-07-15DCR: Quantifying Data Contamination in LLMs Evaluation2025-07-15SentiDrop: A Multi Modal Machine Learning model for Predicting Dropout in Distance Learning2025-07-14GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation2025-07-10Process mining-driven modeling and simulation to enhance fault diagnosis in cyber-physical systems2025-06-26FINN-GL: Generalized Mixed-Precision Extensions for FPGA-Accelerated LSTMs2025-06-25