TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Beam Rider

Atari Games on Atari 2600 Beam Rider

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1MuZero454993.53NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
2GDI-H3422890NoGeneralized Data Distribution Iteration2022-06-07-
3MuZero (Res2 Adam)333077.44NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
4Agent57300509.8NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
5R2D2188257.4No--Code
6GDI-I3162100NoGeneralized Data Distribution Iteration2022-06-07-
7GDI-I3162100NoGeneralized Data Distribution Iteration2022-06-07-
8Ape-X63305.2NoDistributed Prioritized Experience Replay2018-03-02Code
9IQN42776NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
10Prior+Duel hs37412.2NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
11QR-DQN-134821NoDistributional Reinforcement Learning with Quant...2017-10-27Code
12IMPALA (deep)32463.47NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
13Prior hs31181.3NoPrioritized Experience Replay2015-11-18Code
14Prior+Duel noop30276.5NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
15ASL DDQN26841.6NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
16A3C LSTM hs24622.2NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
17Bootstrapped DQN23429.8NoDeep Exploration via Bootstrapped DQN2016-02-15Code
18Prior noop23384.2NoPrioritized Experience Replay2015-11-18Code
19NoisyNet-Dueling23134NoNoisy Networks for Exploration2017-06-30Code
20A3C FF hs22707.9NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
21DNA20393NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
22DreamerV218646NoMastering Atari with Discrete World Models2020-10-05Code
23DDQN (tuned) hs17417.2NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
24DDRL A3C14900NoDistributed Deep Reinforcement Learning: Learn h...2018-01-09Code
25Duel hs14591.3NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
26C51 noop14074NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
27DDQN (tuned) noop13772.8NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
28A3C FF (1 day) hs13235.9NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
29Persistent AL13145.34NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
30Duel noop12164NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
31Reactor 500M11033.4NoThe Reactor: A fast and sample-efficient Actor-C...2017-04-15-
32Advantage Learning10054.58NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
33DQN hs9743.2NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
34DQN noop8627.5NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
35DDQN+Pop-Art noop8299.4NoLearning values across many orders of magnitude2016-02-24-
36Nature DQN6846No--Code
37UCT6624.6NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
38MAC6072NoMean Actor Critic2017-09-01Code
39RIMs-PPO5320NoRecurrent Independent Mechanisms2019-09-24Code
40DQN Best5184NoPlaying Atari with Deep Reinforcement Learning2013-12-19Code
41POP3D4549NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
42Gorila3822.1NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
43A2C + SIL2366.2NoSelf-Imitation Learning2018-06-14Code
44SARSA1743No---
45CGP1341.6NoEvolving simple programs for playing Atari games2018-06-14Code
46Best Learner929.4NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
47ES FF (1 hour) noop744NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
48SAC432.1NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code