TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Pong

Atari Games on Atari 2600 Pong

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1Duel noop21NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
2ES FF (1 hour) noop21NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
3IQN21NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
4MuZero21NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
5R2D221No--Code
6NoisyNet-Dueling21NoNoisy Networks for Exploration2017-06-30Code
7DQN Best21NoPlaying Atari with Deep Reinforcement Learning2013-12-19Code
8QR-DQN-121NoDistributional Reinforcement Learning with Quant...2017-10-27Code
9UCT21NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
10GDI-H3(200M frames)21NoGeneralized Data Distribution Iteration2022-06-07-
11GDI-I3(200M frames)21NoGeneralized Data Distribution Iteration2022-06-07-
12GDI-I321NoGeneralized Data Distribution Iteration2022-06-07-
13GDI-H321NoGeneralized Data Distribution Iteration2022-06-07-
14ASL DDQN21NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
15IMPALA (deep)20.98NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
16MuZero (Res2 Adam)20.95NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
17DDQN (tuned) noop20.9NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
18Prior+Duel noop20.9NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
19C51 noop20.9NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
20Bootstrapped DQN20.9NoDeep Exploration via Bootstrapped DQN2016-02-15Code
21Ape-X20.9NoDistributed Prioritized Experience Replay2018-03-02Code
22A2C + SIL20.9NoSelf-Imitation Learning2018-06-14Code
23Agent5720.67NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
24Prior noop20.6NoPrioritized Experience Replay2015-11-18Code
25DDQN+Pop-Art noop20.6NoLearning values across many orders of magnitude2016-02-24-
26POP3D20.5NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
27Discrete Latent Space World Model (VQ-VAE)20.2NoSmaller World Models for Reinforcement Learning2020-10-12-
28DDRL A3C20NoDistributed Deep Reinforcement Learning: Learn h...2018-01-09Code
29CGP20NoEvolving simple programs for playing Atari games2018-06-14Code
30DreamerV220NoMastering Atari with Discrete World Models2020-10-05Code
31Persistent AL19.76NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
32DNA19.7NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
33Advantage Learning19.66NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
34DQN noop19.5NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
35DDQN (tuned) hs19.1NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
36Nature DQN18.9No--Code
37Prior hs18.9NoPrioritized Experience Replay2015-11-18Code
38Duel hs18.8NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
39Prior+Duel hs18.4NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
40Recurrent Rational DQN Average18.13NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
41Rational DQN Average18.04NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
42DQN hs18NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
43DT17.1YesDecision Transformer: Reinforcement Learning via...2021-06-02Code
44Gorila16.7NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
45A3C FF (1 day) hs11.4NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
46A3C LSTM hs10.7NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
47MAC10.6NoMean Actor Critic2017-09-01Code
48A3C FF hs5.6NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
49CURL2.1NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
50SARSA-17.4No---
51Best Learner-19NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
52SAC-20.98NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code