TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Alien

Atari Games on Atari 2600 Alien

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1MuZero741812.63NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
2Agent57297638.17NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
3GDI-H3(1B frames)279700No---
4R2D2229496.9No--Code
5MuZero (Res2 Adam)70192.35NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
6GDI-H348735NoGeneralized Data Distribution Iteration2022-06-07-
7GDI-I343384NoGeneralized Data Distribution Iteration2022-06-07-
8Ape-X40804.9NoDistributed Prioritized Experience Replay2018-03-02Code
9FQF16754.6NoFully Parameterized Quantile Function for Distri...2019-11-05Code
10IMPALA (deep)15962.1NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
11Reactor 500M12689.1NoThe Reactor: A fast and sample-efficient Actor-C...2017-04-15-
12UCT7785NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
13IQN7022NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
14ASL DDQN6955.2NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
15NoisyNet-Dueling5778NoNoisy Networks for Exploration2017-06-30Code
16Persistent AL5699.81NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
17DNA5021NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
18Advantage Learning4990.91NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
19QR-DQN-14871NoDistributional Reinforcement Learning with Quant...2017-10-27Code
20Duel noop4461.4NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
21Prior noop4203.8NoPrioritized Experience Replay2015-11-18Code
22DreamerV23967NoMastering Atari with Discrete World Models2020-10-05Code
23Prior+Duel noop3941NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
24DDQN (tuned) noop3747.7NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
25DDQN+Pop-Art noop3213.5NoLearning values across many orders of magnitude2016-02-24-
26C51 noop3166NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
27Nature DQN3069No--Code
28Bootstrapped DQN2436.6NoDeep Exploration via Bootstrapped DQN2016-02-15Code
29A2C + SIL2242.2NoSelf-Imitation Learning2018-06-14Code
30CGP1978NoEvolving simple programs for playing Atari games2018-06-14Code
31DQN noop1620NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
32POP3D1510.8NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
33Duel hs1486.5NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
34VPN1429NoValue Prediction Network2017-07-11Code
35Prior hs1334.7NoPrioritized Experience Replay2015-11-18Code
36Rainbow+SEER1172.6NoImproving Computational Efficiency in Visual Rei...2021-03-04Code
37CURL1148.2NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
38DDQN (tuned) hs1033.4NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
39ES FF (1 hour) noop994NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
40A3C LSTM hs945.3NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
41Best Learner939.2NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
42Prior+Duel hs823.7NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
43Prior+Duel hs823.7NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
44Gorila813.5NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
45DQN hs634NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
46A3C FF hs518.4NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
47SAC216.9NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code
48A3C FF (1 day) hs182.1NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
49SARSA103.2No---