TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Video Games/Atari 2600 Kangaroo

Video Games on Atari 2600 Kangaroo

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1Agent5724034.16NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
2MuZero16763.6NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
3Prior noop16200NoPrioritized Experience Replay2015-11-18Code
4IQN15487NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
5QR-DQN-115356NoDistributional Reinforcement Learning with Quant...2017-10-27Code
6NoisyNet-Dueling15227NoNoisy Networks for Exploration2017-06-30Code
7Bootstrapped DQN14862.5NoDeep Exploration via Bootstrapped DQN2016-02-15Code
8Duel noop14854NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
9GDI-H314636NoGeneralized Data Distribution Iteration2022-06-07-
10GDI-I314500NoGeneralized Data Distribution Iteration2022-06-07-
11GDI-I314500NoGeneralized Data Distribution Iteration2022-06-07-
12DNA14373NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
13R2D214130.7No--Code
14DreamerV214064NoMastering Atari with Discrete World Models2020-10-05Code
15MuZero (Res2 Adam)13838NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
16DDQN+Pop-Art noop13150NoLearning values across many orders of magnitude2016-02-24-
17ASL DDQN13027NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
18DDQN (tuned) noop12992NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
19C51 noop12853NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
20Prior hs12185NoPrioritized Experience Replay2015-11-18Code
21Persistent AL11478.46NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
22DDQN (tuned) hs11204NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
23ES FF (1 hour) noop11200NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
24Advantage Learning10809.16NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
25Duel hs10334NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
26DQN noop7259NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
27Nature DQN6740No--Code
28Recurrent Rational DQN Average5266NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
29DQN hs4496NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
30POP3D3891.67NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
31Rational DQN Average2941NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
32A2C + SIL2888.3NoSelf-Imitation Learning2018-06-14Code
33UCT1990NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
34Prior+Duel noop1792NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
35IMPALA (deep)1632NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
36Best Learner1622.1NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
37Gorila1431NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
38Ape-X1416NoDistributed Prioritized Experience Replay2018-03-02Code
39CGP1400NoEvolving simple programs for playing Atari games2018-06-14Code
40IDVQ + DRSC + XNES1200NoPlaying Atari with Six Neurons2018-06-04Code
41Prior+Duel hs861NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
42CURL345.3NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
43A3C LSTM hs125NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
44A3C FF (1 day) hs106NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
45A3C FF hs94NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
46SAC29.3NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code
47SARSA8.8No---