TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Agent57: Outperforming the Atari Human Benchmark

Agent57: Outperforming the Atari Human Benchmark

Adrià Puigdomènech Badia, Bilal Piot, Steven Kapturowski, Pablo Sprechmann, Alex Vitvitskyi, Daniel Guo, Charles Blundell

2020-03-30ICML 2020 1Reinforcement LearningAtari Games
PaperPDFCodeCodeCodeCodeCode

Abstract

Atari games have been a long-standing benchmark in the reinforcement learning (RL) community for the past decade. This benchmark was proposed to test general competency of RL algorithms. Previous work has achieved good average performance by doing outstandingly well on many games of the set, but very poorly in several of the most challenging games. We propose Agent57, the first deep RL agent that outperforms the standard human benchmark on all 57 Atari games. To achieve this result, we train a neural network which parameterizes a family of policies ranging from very exploratory to purely exploitative. We propose an adaptive mechanism to choose which policy to prioritize throughout the training process. Additionally, we utilize a novel parameterization of the architecture that allows for more consistent and stable learning.

Results

TaskDatasetMetricValueModel
Atari Gamesatari gameHuman World Record Breakthrough18Agent57
Atari GamesAtari 2600 BoxingScore100Agent57
Atari GamesAtari 2600 SkiingScore-4202.6Agent57
Atari GamesAtari 2600 Double DunkScore23.93Agent57
Atari GamesAtari 2600 Ms. PacmanScore63994.44Agent57
Atari GamesAtari 2600 CentipedeScore412847.86Agent57
Atari GamesAtari 2600 TutankhamScore2354.91Agent57
Atari GamesAtari 2600 FreewayScore32.59Agent57
Atari GamesAtari 2600 PongScore20.67Agent57
Atari GamesAtari 2600 EnduroScore2367.71Agent57
Atari GamesAtari 2600 KrullScore251997.31Agent57
Atari GamesAtari 2600 BreakoutScore790.4Agent57
Atari GamesAtari 2600 FrostbiteScore541280.88Agent57
Atari GamesAtari 2600 Yars RevengeScore998532.37Agent57
Atari GamesAtari 2600 Montezuma's RevengeScore9352.01Agent57
Atari GamesAtari 2600 GopherScore117777.08Agent57
Atari GamesAtari 2600 Space InvadersScore48680.86Agent57
Atari GamesAtari 2600 James BondScore135784.96Agent57
Atari GamesAtari 2600 AmidarScore29660.08Agent57
Atari GamesAtari 2600 TennisScore23.84Agent57
Atari GamesAtari 2600 Crazy ClimberScore565909.85Agent57
Atari GamesAtari 2600 AsteroidsScore150854.61Agent57
Atari GamesAtari 2600 GravitarScore19213.96Agent57
Atari GamesAtari 2600 Time PilotScore405425.31Agent57
Atari GamesAtari 2600 Demon AttackScore143161.44Agent57
Atari GamesAtari 2600 Battle ZoneScore934134.88Agent57
Atari GamesAtari 2600 PhoenixScore908264.15Agent57
Atari GamesAtari 2600 Beam RiderScore300509.8Agent57
Atari GamesAtari 2600 AsterixScore991384.42Agent57
Atari GamesAtari 2600 Kung-Fu MasterScore206845.82Agent57
Atari GamesAtari 2600 BowlingScore251.18Agent57
Atari GamesAtari 2600 KangarooScore24034.16Agent57
Atari GamesAtari 2600 AssaultScore67212.67Agent57
Atari GamesAtari 2600 AlienScore297638.17Agent57
Atari GamesAtari 2600 Fishing DerbyScore86.97Agent57
Atari GamesAtari 2600 Pitfall!Score18756.01Agent57
Atari GamesAtari 2600 SeaquestScore999997.63Agent57
Atari GamesAtari 2600 Chopper CommandScore999900Agent57
Atari GamesAtari 2600 SolarisScore44199.93Agent57
Atari GamesAtari 2600 SurroundScore9.5Agent57
Atari GamesAtari 2600 Video PinballScore992340.74Agent57
Atari GamesAtari 2600 Wizard of WorScore157306.41Agent57
Atari GamesAtari 2600 ZaxxonScore249808.9Agent57
Atari GamesAtari 2600 DefenderScore677642.78Agent57
Atari GamesAtari 2600 RobotankScore127.32Agent57
Atari GamesAtari 2600 Name This GameScore54386.77Agent57
Atari GamesAtari 2600 Star GunnerScore839573.53Agent57
Atari GamesAtari 2600 Ice HockeyScore63.64Agent57
Atari GamesAtari 2600 BerzerkScore61507.83Agent57
Atari GamesAtari 2600 AtlantisScore1528841.76Agent57
Atari GamesAtari 2600 HEROScore114736.26Agent57
Atari GamesAtari 2600 Bank HeistScore23071.5Agent57
Atari GamesAtari 2600 VentureScore2623.71Agent57
Atari GamesAtari 2600 Private EyeScore79716.46Agent57
Atari GamesAtari 2600 Q*BertScore580328.14Agent57
Atari GamesAtari 2600 River RaidScore63318.67Agent57
Atari GamesAtari 2600 Road RunnerScore243025.8Agent57
Atari GamesAtari 2600 Up and DownScore623805.73Agent57
Video Gamesatari gameHuman World Record Breakthrough18Agent57
Video GamesAtari 2600 BoxingScore100Agent57
Video GamesAtari 2600 SkiingScore-4202.6Agent57
Video GamesAtari 2600 Double DunkScore23.93Agent57
Video GamesAtari 2600 Ms. PacmanScore63994.44Agent57
Video GamesAtari 2600 CentipedeScore412847.86Agent57
Video GamesAtari 2600 TutankhamScore2354.91Agent57
Video GamesAtari 2600 FreewayScore32.59Agent57
Video GamesAtari 2600 PongScore20.67Agent57
Video GamesAtari 2600 EnduroScore2367.71Agent57
Video GamesAtari 2600 KrullScore251997.31Agent57
Video GamesAtari 2600 BreakoutScore790.4Agent57
Video GamesAtari 2600 FrostbiteScore541280.88Agent57
Video GamesAtari 2600 Yars RevengeScore998532.37Agent57
Video GamesAtari 2600 Montezuma's RevengeScore9352.01Agent57
Video GamesAtari 2600 GopherScore117777.08Agent57
Video GamesAtari 2600 Space InvadersScore48680.86Agent57
Video GamesAtari 2600 James BondScore135784.96Agent57
Video GamesAtari 2600 AmidarScore29660.08Agent57
Video GamesAtari 2600 TennisScore23.84Agent57
Video GamesAtari 2600 Crazy ClimberScore565909.85Agent57
Video GamesAtari 2600 AsteroidsScore150854.61Agent57
Video GamesAtari 2600 GravitarScore19213.96Agent57
Video GamesAtari 2600 Time PilotScore405425.31Agent57
Video GamesAtari 2600 Demon AttackScore143161.44Agent57
Video GamesAtari 2600 Battle ZoneScore934134.88Agent57
Video GamesAtari 2600 PhoenixScore908264.15Agent57
Video GamesAtari 2600 Beam RiderScore300509.8Agent57
Video GamesAtari 2600 AsterixScore991384.42Agent57
Video GamesAtari 2600 Kung-Fu MasterScore206845.82Agent57
Video GamesAtari 2600 BowlingScore251.18Agent57
Video GamesAtari 2600 KangarooScore24034.16Agent57
Video GamesAtari 2600 AssaultScore67212.67Agent57
Video GamesAtari 2600 AlienScore297638.17Agent57
Video GamesAtari 2600 Fishing DerbyScore86.97Agent57
Video GamesAtari 2600 Pitfall!Score18756.01Agent57
Video GamesAtari 2600 SeaquestScore999997.63Agent57
Video GamesAtari 2600 Chopper CommandScore999900Agent57
Video GamesAtari 2600 SolarisScore44199.93Agent57
Video GamesAtari 2600 SurroundScore9.5Agent57
Video GamesAtari 2600 Video PinballScore992340.74Agent57
Video GamesAtari 2600 Wizard of WorScore157306.41Agent57
Video GamesAtari 2600 ZaxxonScore249808.9Agent57
Video GamesAtari 2600 DefenderScore677642.78Agent57
Video GamesAtari 2600 RobotankScore127.32Agent57
Video GamesAtari 2600 Name This GameScore54386.77Agent57
Video GamesAtari 2600 Star GunnerScore839573.53Agent57
Video GamesAtari 2600 Ice HockeyScore63.64Agent57
Video GamesAtari 2600 BerzerkScore61507.83Agent57
Video GamesAtari 2600 AtlantisScore1528841.76Agent57
Video GamesAtari 2600 HEROScore114736.26Agent57
Video GamesAtari 2600 Bank HeistScore23071.5Agent57
Video GamesAtari 2600 VentureScore2623.71Agent57
Video GamesAtari 2600 Private EyeScore79716.46Agent57
Video GamesAtari 2600 Q*BertScore580328.14Agent57
Video GamesAtari 2600 River RaidScore63318.67Agent57
Video GamesAtari 2600 Road RunnerScore243025.8Agent57
Video GamesAtari 2600 Up and DownScore623805.73Agent57

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17