TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Q*Bert

Atari Games on Atari 2600 Q*Bert

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1Agent57580328.14NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
2QR-DQN-1572510NoDistributional Reinforcement Learning with Quant...2017-10-27Code
3R2D2408850No--Code
4IMPALA (deep)351200.12NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
5Ape-X302391.3NoDistributed Prioritized Experience Replay2018-03-02Code
6A2C + SIL104975.6NoSelf-Imitation Learning2018-06-14Code
7MuZero (Res2 Adam)94906.25NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
8DreamerV294688NoMastering Atari with Discrete World Models2020-10-05Code
9MuZero72276NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
10DNA52398NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
11GDI-H3(200M frames)28657NoGeneralized Data Distribution Iteration2022-06-07-
12GDI-H328657NoGeneralized Data Distribution Iteration2022-06-07-
13GDI-I327800NoGDI: Rethinking What Makes Reinforcement Learnin...2021-06-11-
14GDI-I327800NoGDI: Rethinking What Makes Reinforcement Learnin...2021-06-11-
15NoisyNet-Dueling27121NoNoisy Networks for Exploration2017-06-30Code
16IQN25750NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
17ASL DDQN24548.8NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
18C51 noop23784NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
19A3C LSTM hs21307.5NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
20Duel noop19220.3NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
21Prior+Duel noop18760.3NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
22UCT17343.4NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
23Prior noop16256.5NoPrioritized Experience Replay2015-11-18Code
24MP-EB15805NoIncentivizing Exploration In Reinforcement Learn...2015-07-03Code
25POP3D15396.67NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
26A3C FF hs15148.8NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
27Bootstrapped DQN15092.7NoDeep Exploration via Bootstrapped DQN2016-02-15Code
28DDQN (tuned) noop15088.5NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
29VPN14517NoValue Prediction Network2017-07-11Code
30Rational DQN Average14436NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
31Advantage Learning14368.03NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
32Duel hs14175.8NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
33MFEC14135NoModel-Free Episodic Control with State Aggregation2020-08-21-
34Recurrent Rational DQN Average14080NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
35Prior+Duel hs14063NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
36A3C FF (1 day) hs13752.3NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
37DQN noop13117.3NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
38DDQN (tuned) hs11020.8NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
39Nature DQN10596No--Code
40Prior hs9944NoPrioritized Experience Replay2015-11-18Code
41DQN hs9271.5NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
42Gorila7089.8NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
43DDQN+Pop-Art noop5236.8NoLearning values across many orders of magnitude2016-02-24-
44DQN Best4500NoPlaying Atari with Deep Reinforcement Learning2013-12-19Code
45Qbert Rainbow+SEER4123.5NoImproving Computational Efficiency in Visual Rei...2021-03-04Code
46Sarsa-φ-EB4111.8NoCount-Based Exploration in Feature Space for Rei...2017-06-25Code
47Sarsa-ε3895.3NoCount-Based Exploration in Feature Space for Rei...2017-06-25Code
48IDVQ + DRSC + XNES1250NoPlaying Atari with Six Neurons2018-06-04Code
49CURL1225.6NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
50SARSA960.3No---
51CGP770NoEvolving simple programs for playing Atari games2018-06-14Code
52Best Learner613.5NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
53SAC280.5NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code
54MAC243.4NoMean Actor Critic2017-09-01Code
55ES FF (1 hour) noop147.5NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
56DT25.1NoDecision Transformer: Reinforcement Learning via...2021-06-02Code