TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Zaxxon

Atari Games on Atari 2600 Zaxxon

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1MuZero725853.9NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
2Agent57249808.9NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
3R2D2224910.7No--Code
4GDI-H3216020NoGeneralized Data Distribution Iteration2022-06-07-
5MuZero (Res2 Adam)154131.86NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
6GDI-I3109140NoGeneralized Data Distribution Iteration2022-06-07-
7DreamerV250699NoMastering Atari with Discrete World Models2020-10-05Code
8Ape-X42285.5NoDistributed Prioritized Experience Replay2018-03-02Code
9IMPALA (deep)32935.5NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
10A3C FF hs24622NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
11A3C LSTM hs23519NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
12UCT22610NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
13DNA22588NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
14IQN21772NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
15ASL DDQN16420NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
16RIMs-PPO15000NoRecurrent Independent Mechanisms2019-09-24Code
17NoisyNet-Dueling14874NoNoisy Networks for Exploration2017-06-30Code
18DDQN+Pop-Art noop14402NoLearning values across many orders of magnitude2016-02-24-
19Prior+Duel noop13886NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
20QR-DQN-113112NoDistributional Reinforcement Learning with Quant...2017-10-27Code
21Duel noop12944NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
22Bootstrapped DQN11491.7NoDeep Exploration via Bootstrapped DQN2016-02-15Code
23Prior+Duel hs11320NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
24C51 noop10513NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
25Prior noop10469NoPrioritized Experience Replay2015-11-18Code
26Duel hs10164NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
27DDQN (tuned) noop10163NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
28Prior hs9474NoPrioritized Experience Replay2015-11-18Code
29POP3D9472NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
30A2C + SIL9164.2NoSelf-Imitation Learning2018-06-14Code
31Advantage Learning9129.61NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
32DDQN (tuned) hs8593NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
33ES FF (1 hour) noop6380NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
34Gorila6159.4NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
35DQN noop5363NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
36Nature DQN4977No--Code
37DQN hs4412NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
38Best Learner3365.1NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
39CGP2980NoEvolving simple programs for playing Atari games2018-06-14Code
40A3C FF (1 day) hs2659NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
41SARSA21.4No---