TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Time Pilot

Atari Games on Atari 2600 Time Pilot

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1MuZero476763.9NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
2GDI-H3450810NoGeneralized Data Distribution Iteration2022-06-07-
3R2D2445377.3No--Code
4MuZero (Res2 Adam)424011.16NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
5Agent57405425.31NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
6GDI-I3216770NoGeneralized Data Distribution Iteration2022-06-07-
7GDI-I3216770NoGeneralized Data Distribution Iteration2022-06-07-
8Ape-X87085NoDistributed Prioritized Experience Replay2018-03-02Code
9UCT63854.5NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
10IMPALA (deep)48481.5NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
11DreamerV237945NoMastering Atari with Discrete World Models2020-10-05Code
12A3C LSTM hs27202NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
13Rational DQN Average17632NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
14NoisyNet-Dueling17301NoNoisy Networks for Exploration2017-06-30Code
15Recurrent Rational DQN Average13261NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
16DNA12774NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
17A3C FF hs12679NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
18IQN12236NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
19ASL DDQN12071NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
20CGP12040NoEvolving simple programs for playing Atari games2018-06-14Code
21Duel noop11666NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
22A2C + SIL10811.7NoSelf-Imitation Learning2018-06-14Code
23QR-DQN-110345NoDistributional Reinforcement Learning with Quant...2017-10-27Code
24Prior noop9197NoPrioritized Experience Replay2015-11-18Code
25Bootstrapped DQN9079.4NoDeep Exploration via Bootstrapped DQN2016-02-15Code
26Advantage Learning8969.12NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
27DDQN (tuned) noop8339NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
28C51 noop8329NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
29Gorila8267.8NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
30Prior+Duel noop7553NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
31DDQN (tuned) hs6608NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
32Duel hs6601NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
33Prior hs5963NoPrioritized Experience Replay2015-11-18Code
34Nature DQN5947No--Code
35A3C FF (1 day) hs5825NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
36ES FF (1 hour) noop4970NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
37Prior+Duel hs4871NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
38DQN noop4870NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
39DDQN+Pop-Art noop4870NoLearning values across many orders of magnitude2016-02-24-
40DQN hs4786NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
41IDVQ + DRSC + XNES4600NoPlaying Atari with Six Neurons2018-06-04Code
42POP3D3770.33NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
43Best Learner3741.2NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
44SARSA24.9No---