TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Asteroids

Atari Games on Atari 2600 Asteroids

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1GDI-H3760005NoGeneralized Data Distribution Iteration2022-06-07-
2GDI-I3751970NoGeneralized Data Distribution Iteration2022-06-07-
3MuZero678558.64NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
4MuZero (Res2 Adam)476412NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
5R2D2357867.7No--Code
6DNA165973NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
7Ape-X155495.1NoDistributed Prioritized Experience Replay2018-03-02Code
8Agent57150854.61NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
9IMPALA (deep)108590.05NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
10NoisyNet-Dueling86700NoNoisy Networks for Exploration2017-06-30Code
11DreamerV241526NoMastering Atari with Discrete World Models2020-10-05Code
12CGP9412NoEvolving simple programs for playing Atari games2018-06-14Code
13A3C LSTM hs5093.1NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
14UCT4660.6NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
15FQF4553NoFully Parameterized Quantile Function for Distri...2019-11-05Code
16A3C FF hs4474.5NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
17QR-DQN-14226NoDistributional Reinforcement Learning with Quant...2017-10-27Code
18Reactor 500M3726.1NoThe Reactor: A fast and sample-efficient Actor-C...2017-04-15-
19A3C FF (1 day) hs3009.4NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
20IQN2898NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
21DDQN+Pop-Art noop2869.3NoLearning values across many orders of magnitude2016-02-24-
22Duel noop2837.7NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
23Prior noop2654.3NoPrioritized Experience Replay2015-11-18Code
24POP3D2488.1NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
25A2C + SIL2259.4NoSelf-Imitation Learning2018-06-14Code
26Duel hs2035.4NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
27ASL DDQN1984.5NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
28Advantage Learning1924.42NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
29Prior hs1745.1NoPrioritized Experience Replay2015-11-18Code
30Persistent AL1673.52NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
31Nature DQN1629No--Code
32ES FF (1 hour) noop1562NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
33C51 noop1516NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
34DQN hs1458.7NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
35DQN noop1364.5NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
36DDQN (tuned) hs1193.2NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
37Prior+Duel noop1192.7NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
38Bootstrapped DQN1032NoDeep Exploration via Bootstrapped DQN2016-02-15Code
39Prior+Duel hs1021.9NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
40Gorila933.6NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
41Best Learner907.3NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
42DDQN (tuned) noop734.7NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
43SARSA89No---