TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Bank Heist

Atari Games on Atari 2600 Bank Heist

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1MuZero (Res2 Adam)27219.8NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
2R2D224235.9No--Code
3Agent5723071.5NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
4Ape-X1716.4NoDistributed Prioritized Experience Replay2018-03-02Code
5Duel noop1611.9NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
6Prior+Duel noop1503.1NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
7IQN1416NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
8GDI-I31401NoGeneralized Data Distribution Iteration2022-06-07-
9GDI-H31380NoGeneralized Data Distribution Iteration2022-06-07-
10ASL DDQN1340.9NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
11NoisyNet-Dueling1318NoNoisy Networks for Exploration2017-06-30Code
12DNA1286NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
13MuZero1278.98NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
14Reactor 500M1259.7NoThe Reactor: A fast and sample-efficient Actor-C...2017-04-15-
15QR-DQN-11249NoDistributional Reinforcement Learning with Quant...2017-10-27Code
16IMPALA (deep)1223.15NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
17POP3D1212.23NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
18Bootstrapped DQN1208NoDeep Exploration via Bootstrapped DQN2016-02-15Code
19A2C + SIL1137.8NoSelf-Imitation Learning2018-06-14Code
20Duel hs1129.3NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
21DreamerV21126NoMastering Atari with Discrete World Models2020-10-05Code
22DDQN+Pop-Art noop1103.3NoLearning values across many orders of magnitude2016-02-24-
23Prior noop1054.6NoPrioritized Experience Replay2015-11-18Code
24DDQN (tuned) noop1030.6NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
25Prior+Duel hs1004.6NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
26C51 noop976NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
27A3C FF hs970.1NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
28A3C FF (1 day) hs946NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
29A3C LSTM hs932.8NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
30DDQN (tuned) hs886NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
31Prior hs876.6NoPrioritized Experience Replay2015-11-18Code
32Persistent AL874.99NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
33Advantage Learning633.63NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
34UCT497.8NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
35DQN noop455NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
36Nature DQN429.7No--Code
37Gorila399.4NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
38DQN hs312.7NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
39Rainbow+SEER276.6NoImproving Computational Efficiency in Visual Rei...2021-03-04Code
40ES FF (1 hour) noop225NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
41CURL193.7NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
42Best Learner190.8NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
43CGP148NoEvolving simple programs for playing Atari games2018-06-14Code
44Discrete Latent Space World Model (VQ-VAE)121.6NoSmaller World Models for Reinforcement Learning2020-10-12-
45SARSA67.4No---