TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Video Games/Atari 2600 Up and Down

Video Games on Atari 2600 Up and Down

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1GDI-I3986440NoGeneralized Data Distribution Iteration2022-06-07-
2GDI-I3986440NoGeneralized Data Distribution Iteration2022-06-07-
3GDI-H3966590NoGeneralized Data Distribution Iteration2022-06-07-
4MuZero715545.61NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
5DreamerV2653662NoMastering Atari with Discrete World Models2020-10-05Code
6MuZero (Res2 Adam)634898.18NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
7Agent57623805.73NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
8R2D2589226.9No--Code
9Ape-X401884.3NoDistributed Prioritized Experience Replay2018-03-02Code
10RIMs-PPO390000NoRecurrent Independent Mechanisms2019-09-24Code
11IMPALA (deep)332546.75NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
12DNA291934NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
13POP3D242701.51NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
14A3C LSTM hs105728.7NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
15IQN88148NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
16A3C FF hs74705.7NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
17UCT74473.6NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
18QR-DQN-171260NoDistributional Reinforcement Learning with Quant...2017-10-27Code
19ES FF (1 hour) noop67974NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
20NoisyNet-Dueling61326NoNoisy Networks for Exploration2017-06-30Code
21A3C FF (1 day) hs54525.4NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
22A2C + SIL53314.6NoSelf-Imitation Learning2018-06-14Code
23Duel noop44939.6NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
24Prior+Duel noop33879.1NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
25Bootstrapped DQN26231NoDeep Exploration via Bootstrapped DQN2016-02-15Code
26ASL DDQN25127.4NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
27Duel hs24759.2NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
28DDQN (tuned) noop22972.2NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
29Prior+Duel hs22681.3NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
30DDQN+Pop-Art noop22474.4NoLearning values across many orders of magnitude2016-02-24-
31DDQN (tuned) hs19086.9NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
32Prior noop16154.1NoPrioritized Experience Replay2015-11-18Code
33C51 noop15612NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
34CGP14524NoEvolving simple programs for playing Atari games2018-06-14Code
35Advantage Learning13909.74NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
36Prior hs12157.4NoPrioritized Experience Replay2015-11-18Code
37DQN noop9989.9NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
38Gorila8747.7NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
39Nature DQN8456No--Code
40DQN hs8038.5NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
41Best Learner3532.7NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
42CURL2735.2NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
43SARSA2449No---
44SAC250.7NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code