TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Video Games/Atari 2600 Wizard of Wor

Video Games on Atari 2600 Wizard of Wor

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1MuZero197126NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
2Agent57157306.41NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
3R2D2144362.7No--Code
4UCT105500NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
5MuZero (Res2 Adam)100096.6NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
6GDI-I364239NoGeneralized Data Distribution Iteration2022-06-07-
7GDI-H363735NoGeneralized Data Distribution Iteration2022-06-07-
8Ape-X46204NoDistributed Prioritized Experience Replay2018-03-02Code
9FQF44782.6NoFully Parameterized Quantile Function for Distri...2019-11-05Code
10IQN31190NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
11QR-DQN-125061NoDistributional Reinforcement Learning with Quant...2017-10-27Code
12ASL DDQN21049NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
13DNA20851NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
14A3C LSTM hs18082NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
15A3C FF hs17244NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
16DreamerV212851NoMastering Atari with Discrete World Models2020-10-05Code
17Prior+Duel noop12352NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
18Prior+Duel hs10471NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
19Gorila10431NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
20Advantage Learning9541.14NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
21C51 noop9300NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
22IMPALA (deep)9157.5NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
23NoisyNet-Dueling9149NoNoisy Networks for Exploration2017-06-30Code
24Duel noop7855NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
25DDQN (tuned) noop7492NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
26A2C + SIL7088.3NoSelf-Imitation Learning2018-06-14Code
27Duel hs7054NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
28Bootstrapped DQN6804.7NoDeep Exploration via Bootstrapped DQN2016-02-15Code
29DDQN (tuned) hs6201NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
30Prior hs5727NoPrioritized Experience Replay2015-11-18Code
31A3C FF (1 day) hs5278NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
32Prior noop4802NoPrioritized Experience Replay2015-11-18Code
33POP3D4704NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
34CGP3820NoEvolving simple programs for playing Atari games2018-06-14Code
35ES FF (1 hour) noop3480NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
36Nature DQN3393No--Code
37DQN noop2704NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
38Best Learner1981.3NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
39DQN hs1609NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
40DDQN+Pop-Art noop483NoLearning values across many orders of magnitude2016-02-24-
41SARSA36.9No---