TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Breakout

Atari Games on Atari 2600 Breakout

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1GDI-H3(200M frames)864NoGeneralized Data Distribution Iteration2022-06-07-
2GDI-I3(200M frames)864NoGeneralized Data Distribution Iteration2022-06-07-
3GDI-I3864NoGeneralized Data Distribution Iteration2022-06-07-
4GDI-H3864NoGeneralized Data Distribution Iteration2022-06-07-
5Bootstrapped DQN855NoDeep Exploration via Bootstrapped DQN2016-02-15Code
6FQF854.2NoFully Parameterized Quantile Function for Distri...2019-11-05Code
7R2D2837.7No--Code
8Ape-X800.9NoDistributed Prioritized Experience Replay2018-03-02Code
9Agent57790.4NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
10IMPALA (deep)787.34NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
11A3C LSTM hs766.8NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
12MuZero (Res2 Adam)758.04NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
13C51 noop748NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
14QR-DQN-1742NoDistributional Reinforcement Learning with Quant...2017-10-27Code
15IQN734NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
16A3C FF hs681.9NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
17DNA626NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
18ASL DDQN621.7NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
19A3C FF (1 day) hs551.6NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
20Reactor 500M514.8NoThe Reactor: A fast and sample-efficient Actor-C...2017-04-15-
21POP3D458.41NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
22A2C + SIL452NoSelf-Imitation Learning2018-06-14Code
23Persistent AL431.89NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
24Advantage Learning425.32NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
25DDQN (tuned) noop418.5NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
26Duel hs411.6NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
27Nature DQN401.2No--Code
28DQN noop385.5NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
29Prior noop373.9NoPrioritized Experience Replay2015-11-18Code
30MAC372.7NoMean Actor Critic2017-09-01Code
31DDQN (tuned) hs368.9NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
32Prior+Duel noop366NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
33UCT364.4NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
34Prior+Duel hs354.6NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
35DQN hs354.5NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
36DDRL A3C350NoDistributed Deep Reinforcement Learning: Learn h...2018-01-09Code
37Duel noop345.3NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
38DDQN+Pop-Art noop344.1NoLearning values across many orders of magnitude2016-02-24-
39Prior hs343NoPrioritized Experience Replay2015-11-18Code
40Recurrent Rational DQN Average336NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
41Rational DQN Average316NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
42Gorila313NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
43DreamerV2312NoMastering Atari with Discrete World Models2020-10-05Code
44DT267.5NoDecision Transformer: Reinforcement Learning via...2021-06-02Code
45NoisyNet-Dueling263NoNoisy Networks for Exploration2017-06-30Code
46DQN Best225NoPlaying Atari with Deep Reinforcement Learning2013-12-19Code
47SPOS180.6NoOptimizing the Neural Architecture of Reinforcem...2020-11-30Code
48ENAS Search space 1161.1NoOptimizing the Neural Architecture of Reinforcem...2020-11-30Code
49SPOS Search space 1144.4NoOptimizing the Neural Architecture of Reinforcem...2020-11-30Code
50ENAS91.4NoOptimizing the Neural Architecture of Reinforcem...2020-11-30Code
51DARQN hard20NoDeep Attention Recurrent Q-Network2015-12-05Code
52CURL18.2NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
53CGP13.2NoEvolving simple programs for playing Atari games2018-06-14Code
54Discrete Latent Space World Model (VQ-VAE)11.6NoSmaller World Models for Reinforcement Learning2020-10-12-
55ES FF (1 hour) noop9.5NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
56SARSA6.1No---
57Best Learner5.2NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
58SAC0.7NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code