TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Video Games/Atari 2600 Asterix

Video Games on Atari 2600 Asterix

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1GDI-H3999999NoGeneralized Data Distribution Iteration2022-06-07-
2R2D2999153.3No--Code
3MuZero998425NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
4Agent57991384.42NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
5MuZero (Res2 Adam)862406.65NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
6GDI-I3759910NoGeneralized Data Distribution Iteration2022-06-07-
7FQF578388.5NoFully Parameterized Quantile Function for Distri...2019-11-05Code
8ASL DDQN567640NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
9C51 noop406211NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
10Prior+Duel noop375080NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
11Prior+Duel hs364200NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
12Prior+Duel hs364200NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
13IQN342016NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
14DNA323965NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
15Ape-X313305NoDistributed Prioritized Experience Replay2018-03-02Code
16IMPALA (deep)300732NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
17UCT290700NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
18QR-DQN-1261025NoDistributional Reinforcement Learning with Quant...2017-10-27Code
19Reactor 500M205914NoThe Reactor: A fast and sample-efficient Actor-C...2017-04-15-
20DreamerV272311NoMastering Atari with Discrete World Models2020-10-05Code
21Prior noop31527NoPrioritized Experience Replay2015-11-18Code
22NoisyNet-Dueling28350NoNoisy Networks for Exploration2017-06-30Code
23Duel noop28188NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
24Prior hs22484.5NoPrioritized Experience Replay2015-11-18Code
25A3C FF hs22140.5NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
26RIMs-PPO21040No---
27Bootstrapped DQN19713.2NoDeep Exploration via Bootstrapped DQN2016-02-15Code
28Persistent AL19564.9NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
29DDQN+Pop-Art noop18919.5NoLearning values across many orders of magnitude2016-02-24-
30Rational DQN Average18109NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
31A2C + SIL17984.2NoSelf-Imitation Learning2018-06-14Code
32DDQN (tuned) noop17356.5NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
33A3C LSTM hs17244.5NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
34DDQN (tuned) hs16837NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
35Duel hs15840NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
36Advantage Learning12852.08NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
37Recurrent Rational DQN Average12621NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
38A3C FF (1 day) hs6723NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
39Nature DQN6012No--Code
40DQN noop4359NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
41POP3D4310.67NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
42Gorila3324.7NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
43DQN hs3170.5NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
44CGP1880NoEvolving simple programs for playing Atari games2018-06-14Code
45ES FF (1 hour) noop1440NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
46SARSA1332No---
47Best Learner987.3NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
48CURL524.3NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
49SAC272NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code