Recurrent Independent Mechanisms

Anirudh Goyal, Alex Lamb, Jordan Hoffmann, Shagun Sodhani, Sergey Levine, Yoshua Bengio, Bernhard Schölkopf

2019-09-24ICLR 2021 1Atari Games

Abstract

Learning modular structures which reflect the dynamics of the environment can lead to better generalization and robustness to changes which only affect a few of the underlying causes. We propose Recurrent Independent Mechanisms (RIMs), a new recurrent architecture in which multiple groups of recurrent cells operate with nearly independent transition dynamics, communicate only sparingly through the bottleneck of attention, and are only updated at time steps where they are most relevant. We show that this leads to specialization amongst the RIMs, which in turn allows for dramatically improved generalization on tasks where some factors of variation differ systematically between training and evaluation.

Results

Task	Dataset	Metric	Value	Model
Atari Games	Atari 2600 Beam Rider	Score	5320	RIMs-PPO
Atari Games	Atari 2600 Zaxxon	Score	15000	RIMs-PPO
Atari Games	Atari 2600 Up and Down	Score	390000	RIMs-PPO
Video Games	Atari 2600 Beam Rider	Score	5320	RIMs-PPO
Video Games	Atari 2600 Zaxxon	Score	15000	RIMs-PPO
Video Games	Atari 2600 Up and Down	Score	390000	RIMs-PPO

Related Papers

Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains2025-07-02 A Principled Path to Fitted Distributional Evaluation2025-06-24 Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments2025-06-17 Meta-learning how to Share Credit among Macro-Actions2025-06-16 TextAtari: 100K Frames Game Playing with Language Agents2025-06-04 Improving Performance of Spike-based Deep Q-Learning using Ternary Neurons2025-06-03 Automatic Reward Shaping from Confounded Offline Data2025-05-16 Unraveling the Rainbow: can value-based methods schedule?2025-05-06