TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Frostbite

Atari Games on Atari 2600 Frostbite

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1MuZero631378.53NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
2Agent57541280.88NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
3MuZero (Res2 Adam)374769.76NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
4R2D2315456.4No--Code
5Fearlessmrx214060NoFully Parameterized Quantile Function for Distri...2019-11-05Code
6DreamerV211384NoMastering Atari with Discrete World Models2020-10-05Code
7GDI-H3(200M frames)11330NoGeneralized Data Distribution Iteration2022-06-07-
8GDI-H311330NoGeneralized Data Distribution Iteration2022-06-07-
9GDI-I310485NoGDI: Rethinking What Makes Reinforcement Learnin...2021-06-11-
10GDI-I310485NoGDI: Rethinking What Makes Reinforcement Learnin...2021-06-11-
11Ape-X9328.6NoDistributed Prioritized Experience Replay2018-03-02Code
12ASL DDQN8616.4NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
13Prior+Duel noop7413NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
14A2C + SIL6289.8NoSelf-Imitation Learning2018-06-14Code
15TRPO-hash5214No#Exploration: A Study of Count-Based Exploration...2016-11-15Code
16Duel noop4672.8NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
17QR-DQN-14384NoDistributional Reinforcement Learning with Quant...2017-10-27Code
18Prior noop4380.1NoPrioritized Experience Replay2015-11-18Code
19IQN4324NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
20Prior+Duel hs4038.4NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
21C51 noop3965NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
22VPN3811NoValue Prediction Network2017-07-11Code
23Prior hs3510NoPrioritized Experience Replay2015-11-18Code
24DDQN+Pop-Art noop3469.6NoLearning values across many orders of magnitude2016-02-24-
25Persistent AL3248.96NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
26NoisyNet-Dueling2923NoNoisy Networks for Exploration2017-06-30Code
27Sarsa-φ-EB2770.1NoCount-Based Exploration in Feature Space for Rei...2017-06-25Code
28MFEC2394NoModel-Free Episodic Control with State Aggregation2020-08-21-
29Duel hs2332.4NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
30Advantage Learning2305.82NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
31Bootstrapped DQN2181.4NoDeep Exploration via Bootstrapped DQN2016-02-15Code
32DDQN (tuned) noop1683.3NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
33DDQN (tuned) hs1448.1NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
34Sarsa-ε1394.3NoCount-Based Exploration in Feature Space for Rei...2017-06-25Code
35CURL924NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
36DQN noop797.4NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
37CGP782NoEvolving simple programs for playing Atari games2018-06-14Code
38MP-EB507NoIncentivizing Exploration In Reinforcement Learn...2015-07-03Code
39DQN hs496.1NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
40Gorila426.6NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
41ES FF (1 hour) noop370NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
42Nature DQN328.3No--Code
43DNA320NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
44IMPALA (deep)317.75NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
45POP3D316.87NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
46IDVQ + DRSC + XNES300NoPlaying Atari with Six Neurons2018-06-04Code
47UCT270.5NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
48Best Learner216.9NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
49A3C LSTM hs197.6NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
50A3C FF hs190.5NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
51SARSA180.9No---
52A3C FF (1 day) hs180.1NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
53SAC59.4NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code