TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Video Games/Atari 2600 Assault

Video Games on Atari 2600 Assault

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1MuZero143972.03NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
2R2D2108197No--Code
3GDI-H397155NoGeneralized Data Distribution Iteration2022-06-07-
4Agent5767212.67NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
5GDI-I363876NoGeneralized Data Distribution Iteration2022-06-07-
6MuZero (Res2 Adam)33292.22NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
7IQN29091NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
8Ape-X24559.4NoDistributed Prioritized Experience Replay2018-03-02Code
9DreamerV223625NoMastering Atari with Discrete World Models2020-10-05Code
10QR-DQN-122012NoDistributional Reinforcement Learning with Quant...2017-10-27Code
11IMPALA (deep)19148.47NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
12DNA16293NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
13A3C LSTM hs14497.9NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
14ASL DDQN14372.8NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
15Prior+Duel noop11477NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
16NoisyNet-Dueling11231NoNoisy Networks for Exploration2017-06-30Code
17Prior+Duel hs10950.6NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
18Prior+Duel hs10950.6NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
19DDQN+Pop-Art noop9011.6NoLearning values across many orders of magnitude2016-02-24-
20Reactor 500M8323.3NoThe Reactor: A fast and sample-efficient Actor-C...2017-04-15-
21Bootstrapped DQN8047.1NoDeep Exploration via Bootstrapped DQN2016-02-15Code
22Prior noop7672.1NoPrioritized Experience Replay2015-11-18Code
23C51 noop7203NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
24Prior hs6548.9NoPrioritized Experience Replay2015-11-18Code
25DDQN (tuned) hs6060.8NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
26A3C FF hs5474.9NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
27POP3D5400.13NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
28DDQN (tuned) noop5393.2NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
29Duel noop4621NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
30DQN noop4280.4NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
31Duel hs3994.8NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
32A3C FF (1 day) hs3746.1NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
33Advantage Learning3661.51NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
34DQN hs3489.3NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
35Nature DQN3359No--Code
36Persistent AL3304.33NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
37A2C + SIL1812NoSelf-Imitation Learning2018-06-14Code
38ES FF (1 hour) noop1673.9NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
39UCT1512.2NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
40Gorila1195.8NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
41CGP890.4NoEvolving simple programs for playing Atari games2018-06-14Code
42Best Learner628NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
43CURL543.7NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
44SARSA537No---
45SAC350NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code