TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/IMPALA: Scalable Distributed Deep-RL with Importance Weigh...

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Simonyan, Volodymir Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg, Koray Kavukcuoglu

2018-02-05ICML 2018 7Reinforcement LearningAtari Gamesreinforcement-learning
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

In this work we aim to solve a large collection of tasks using a single reinforcement learning agent with a single set of parameters. A key challenge is to handle the increased amount of data and extended training time. We have developed a new distributed agent IMPALA (Importance Weighted Actor-Learner Architecture) that not only uses resources more efficiently in single-machine training but also scales to thousands of machines without sacrificing data efficiency or resource utilisation. We achieve stable learning at high throughput by combining decoupled acting and learning with a novel off-policy correction method called V-trace. We demonstrate the effectiveness of IMPALA for multi-task reinforcement learning on DMLab-30 (a set of 30 tasks from the DeepMind Lab environment (Beattie et al., 2016)) and Atari-57 (all available Atari games in Arcade Learning Environment (Bellemare et al., 2013a)). Our results show that IMPALA is able to achieve better performance than previous agents with less data, and crucially exhibits positive transfer between tasks as a result of its multi-task approach.

Results

TaskDatasetMetricValueModel
Atari GamesAtari 2600 BoxingScore99.96IMPALA (deep)
Atari GamesAtari 2600 SkiingScore-10180.38IMPALA (deep)
Atari GamesAtari 2600 Double DunkScore-0.33IMPALA (deep)
Atari GamesAtari 2600 Ms. PacmanScore7342.32IMPALA (deep)
Atari GamesAtari 2600 CentipedeScore11049.75IMPALA (deep)
Atari GamesAtari 2600 TutankhamScore292.11IMPALA (deep)
Atari GamesAtari 2600 PongScore20.98IMPALA (deep)
Atari GamesAtari 2600 KrullScore8147.4IMPALA (deep)
Atari GamesAtari 2600 BreakoutScore787.34IMPALA (deep)
Atari GamesAtari 2600 FrostbiteScore317.75IMPALA (deep)
Atari GamesAtari 2600 Yars RevengeScore84231.14IMPALA (deep)
Atari GamesAtari 2600 GopherScore66782.3IMPALA (deep)
Atari GamesAtari 2600 Space InvadersScore43595.78IMPALA (deep)
Atari GamesAtari 2600 James BondScore601.5IMPALA (deep)
Atari GamesAtari 2600 AmidarScore1554.79IMPALA (deep)
Atari GamesAtari 2600 TennisScore0.55IMPALA (deep)
Atari GamesAtari 2600 Crazy ClimberScore136950IMPALA (deep)
Atari GamesAtari 2600 AsteroidsScore108590.05IMPALA (deep)
Atari GamesAtari 2600 GravitarScore359.5IMPALA (deep)
Atari GamesAtari 2600 Time PilotScore48481.5IMPALA (deep)
Atari GamesAtari 2600 Demon AttackScore132826.98IMPALA (deep)
Atari GamesAtari 2600 Battle ZoneScore20885IMPALA (deep)
Atari GamesAtari 2600 PhoenixScore210996.45IMPALA (deep)
Atari GamesAtari 2600 Beam RiderScore32463.47IMPALA (deep)
Atari GamesAtari 2600 AsterixScore300732IMPALA (deep)
Atari GamesAtari 2600 Kung-Fu MasterScore43375.5IMPALA (deep)
Atari GamesAtari 2600 BowlingScore59.92IMPALA (deep)
Atari GamesAtari 2600 KangarooScore1632IMPALA (deep)
Atari GamesAtari 2600 AssaultScore19148.47IMPALA (deep)
Atari GamesAtari 2600 AlienScore15962.1IMPALA (deep)
Atari GamesAtari 2600 Fishing DerbyScore44.85IMPALA (deep)
Atari GamesAtari 2600 Pitfall!Score-1.66IMPALA (deep)
Atari GamesAtari 2600 SeaquestScore1753.2IMPALA (deep)
Atari GamesAtari 2600 Chopper CommandScore28255IMPALA (deep)
Atari GamesAtari-57Human World Record Breakthrough3IMPALA, deep
Atari GamesAtari 2600 SolarisScore2365IMPALA (deep)
Atari GamesAtari 2600 SurroundScore7.56IMPALA (deep)
Atari GamesAtari 2600 Video PinballScore572898.27IMPALA (deep)
Atari GamesAtari 2600 Wizard of WorScore9157.5IMPALA (deep)
Atari GamesAtari 2600 ZaxxonScore32935.5IMPALA (deep)
Atari GamesAtari 2600 DefenderScore185203IMPALA (deep)
Atari GamesAtari 2600 RobotankScore12.96IMPALA (deep)
Atari GamesAtari 2600 Name This GameScore21537.2IMPALA (deep)
Atari GamesAtari 2600 Star GunnerScore200625IMPALA (deep)
Atari GamesAtari 2600 Ice HockeyScore3.48IMPALA (deep)
Atari GamesAtari 2600 BerzerkScore1852.7IMPALA (deep)
Atari GamesAtari 2600 AtlantisScore849967.5IMPALA (deep)
Atari GamesAtari 2600 HEROScore33730.55IMPALA (deep)
Atari GamesAtari 2600 Bank HeistScore1223.15IMPALA (deep)
Atari GamesAtari 2600 Private EyeScore98.5IMPALA (deep)
Atari GamesAtari 2600 Q*BertScore351200.12IMPALA (deep)
Atari GamesAtari 2600 River RaidScore29608.05IMPALA (deep)
Atari GamesAtari 2600 Road RunnerScore57121IMPALA (deep)
Atari GamesAtari 2600 Up and DownScore332546.75IMPALA (deep)
Video GamesAtari 2600 BoxingScore99.96IMPALA (deep)
Video GamesAtari 2600 SkiingScore-10180.38IMPALA (deep)
Video GamesAtari 2600 Double DunkScore-0.33IMPALA (deep)
Video GamesAtari 2600 Ms. PacmanScore7342.32IMPALA (deep)
Video GamesAtari 2600 CentipedeScore11049.75IMPALA (deep)
Video GamesAtari 2600 TutankhamScore292.11IMPALA (deep)
Video GamesAtari 2600 PongScore20.98IMPALA (deep)
Video GamesAtari 2600 KrullScore8147.4IMPALA (deep)
Video GamesAtari 2600 BreakoutScore787.34IMPALA (deep)
Video GamesAtari 2600 FrostbiteScore317.75IMPALA (deep)
Video GamesAtari 2600 Yars RevengeScore84231.14IMPALA (deep)
Video GamesAtari 2600 GopherScore66782.3IMPALA (deep)
Video GamesAtari 2600 Space InvadersScore43595.78IMPALA (deep)
Video GamesAtari 2600 James BondScore601.5IMPALA (deep)
Video GamesAtari 2600 AmidarScore1554.79IMPALA (deep)
Video GamesAtari 2600 TennisScore0.55IMPALA (deep)
Video GamesAtari 2600 Crazy ClimberScore136950IMPALA (deep)
Video GamesAtari 2600 AsteroidsScore108590.05IMPALA (deep)
Video GamesAtari 2600 GravitarScore359.5IMPALA (deep)
Video GamesAtari 2600 Time PilotScore48481.5IMPALA (deep)
Video GamesAtari 2600 Demon AttackScore132826.98IMPALA (deep)
Video GamesAtari 2600 Battle ZoneScore20885IMPALA (deep)
Video GamesAtari 2600 PhoenixScore210996.45IMPALA (deep)
Video GamesAtari 2600 Beam RiderScore32463.47IMPALA (deep)
Video GamesAtari 2600 AsterixScore300732IMPALA (deep)
Video GamesAtari 2600 Kung-Fu MasterScore43375.5IMPALA (deep)
Video GamesAtari 2600 BowlingScore59.92IMPALA (deep)
Video GamesAtari 2600 KangarooScore1632IMPALA (deep)
Video GamesAtari 2600 AssaultScore19148.47IMPALA (deep)
Video GamesAtari 2600 AlienScore15962.1IMPALA (deep)
Video GamesAtari 2600 Fishing DerbyScore44.85IMPALA (deep)
Video GamesAtari 2600 Pitfall!Score-1.66IMPALA (deep)
Video GamesAtari 2600 SeaquestScore1753.2IMPALA (deep)
Video GamesAtari 2600 Chopper CommandScore28255IMPALA (deep)
Video GamesAtari-57Human World Record Breakthrough3IMPALA, deep
Video GamesAtari 2600 SolarisScore2365IMPALA (deep)
Video GamesAtari 2600 SurroundScore7.56IMPALA (deep)
Video GamesAtari 2600 Video PinballScore572898.27IMPALA (deep)
Video GamesAtari 2600 Wizard of WorScore9157.5IMPALA (deep)
Video GamesAtari 2600 ZaxxonScore32935.5IMPALA (deep)
Video GamesAtari 2600 DefenderScore185203IMPALA (deep)
Video GamesAtari 2600 RobotankScore12.96IMPALA (deep)
Video GamesAtari 2600 Name This GameScore21537.2IMPALA (deep)
Video GamesAtari 2600 Star GunnerScore200625IMPALA (deep)
Video GamesAtari 2600 Ice HockeyScore3.48IMPALA (deep)
Video GamesAtari 2600 BerzerkScore1852.7IMPALA (deep)
Video GamesAtari 2600 AtlantisScore849967.5IMPALA (deep)
Video GamesAtari 2600 HEROScore33730.55IMPALA (deep)
Video GamesAtari 2600 Bank HeistScore1223.15IMPALA (deep)
Video GamesAtari 2600 Private EyeScore98.5IMPALA (deep)
Video GamesAtari 2600 Q*BertScore351200.12IMPALA (deep)
Video GamesAtari 2600 River RaidScore29608.05IMPALA (deep)
Video GamesAtari 2600 Road RunnerScore57121IMPALA (deep)
Video GamesAtari 2600 Up and DownScore332546.75IMPALA (deep)

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17