TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Demon Attack

Atari Games on Atari 2600 Demon Attack

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1GDI-H3787985NoGeneralized Data Distribution Iteration2022-06-07-
2GDI-I3675530NoGeneralized Data Distribution Iteration2022-06-07-
3GDI-I3675530NoGeneralized Data Distribution Iteration2022-06-07-
4RIMs-PPO230324No---
5MuZero143964.26NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
6MuZero (Res2 Adam)143838.04NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
7Agent57143161.44NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
8R2D2140002.3No--Code
9Ape-X133086.4NoDistributed Prioritized Experience Replay2018-03-02Code
10IMPALA (deep)132826.98NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
11C51 noop130955NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
12IQN128580NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
13QR-DQN-1121551NoDistributional Reinforcement Learning with Quant...2017-10-27Code
14ASL DDQN119773.9NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
15A3C LSTM hs115201.9NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
16Reactor 500M115154NoThe Reactor: A fast and sample-efficient Actor-C...2017-04-15-
17A3C FF hs113308.4NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
18DNA97909NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
19A3C FF (1 day) hs84997.5NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
20Bootstrapped DQN82610NoDeep Exploration via Bootstrapped DQN2016-02-15Code
21DreamerV282263NoMastering Atari with Discrete World Models2020-10-05Code
22Prior+Duel hs73371.3NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
23Prior+Duel noop72878.6NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
24Prior noop71846.4NoPrioritized Experience Replay2015-11-18Code
25Persistent AL70908.17NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
26DDQN (tuned) hs69803.4NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
27NoisyNet-Dueling69311NoNoisy Networks for Exploration2017-06-30Code
28DDQN+Pop-Art noop63644.9NoLearning values across many orders of magnitude2016-02-24-
29Prior hs61277.5NoPrioritized Experience Replay2015-11-18Code
30POP3D61147.33NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
31Duel noop60813.3NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
32DDQN (tuned) noop58044.2NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
33Duel hs56322.8NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
34UCT28158.8NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
35Advantage Learning27153.48NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
36Gorila14880.1NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
37DQN hs12550.7NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
38DQN noop12149.4NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
39A2C + SIL10140.5NoSelf-Imitation Learning2018-06-14Code
40Nature DQN9711No--Code
41CGP2387NoEvolving simple programs for playing Atari games2018-06-14Code
42ES FF (1 hour) noop1166.5NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
43CURL834NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
44Best Learner520.5NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
45IDVQ + DRSC + XNES325NoPlaying Atari with Six Neurons2018-06-04Code
46SARSA0No---