TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Road Runner

Atari Games on Atari 2600 Road Runner

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1GDI-H3999999NoGeneralized Data Distribution Iteration2022-06-07-
2GDI-I3878600NoGeneralized Data Distribution Iteration2022-06-07-
3GDI-I3878600NoGeneralized Data Distribution Iteration2022-06-07-
4MuZero613411.8NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
5R2D2599246.7No--Code
6MuZero (Res2 Adam)531097NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
7Agent57243025.8NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
8NoisyNet-Dueling234352NoNoisy Networks for Exploration2017-06-30Code
9Ape-X222234.5NoDistributed Prioritized Experience Replay2018-03-02Code
10DreamerV2203576NoMastering Atari with Discrete World Models2020-10-05Code
11A3C LSTM hs73949NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
12Duel noop69524NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
13QR-DQN-164262NoDistributional Reinforcement Learning with Quant...2017-10-27Code
14Prior+Duel noop62151NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
15DNA61713NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
16Duel hs58549NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
17IQN57900NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
18Prior noop57608NoPrioritized Experience Replay2015-11-18Code
19IMPALA (deep)57121NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
20A2C + SIL57071.7NoSelf-Imitation Learning2018-06-14Code
21ASL DDQN56520NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
22C51 noop55839NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
23Prior+Duel hs54630NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
24Advantage Learning52351.23NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
25Prior hs52264NoPrioritized Experience Replay2015-11-18Code
26Bootstrapped DQN51500NoDeep Exploration via Bootstrapped DQN2016-02-15Code
27DDQN+Pop-Art noop47770NoLearning values across many orders of magnitude2016-02-24-
28POP3D44679.67NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
29DDQN (tuned) noop44127NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
30DDQN (tuned) hs43156NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
31Gorila43079.8NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
32DQN noop39544NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
33UCT38725NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
34DQN hs35215NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
35A3C FF hs34216NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
36A3C FF (1 day) hs31769NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
37Nature DQN18257No--Code
38ES FF (1 hour) noop16590NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
39Rainbow+SEER11794NoImproving Computational Efficiency in Visual Rei...2021-03-04Code
40CGP8960NoEvolving simple programs for playing Atari games2018-06-14Code
41CURL6786.7NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
42SAC305.3NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code
43SARSA89.1No---
44Best Learner67.7NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code