TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Video Games/Atari 2600 HERO

Video Games on Atari 2600 HERO

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1Agent57114736.26NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
2MuZero49244.11NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
3R2D239537.1No--Code
4C51 noop38874NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
5GDI-I338330NoGeneralized Data Distribution Iteration2022-06-07-
6GDI-I338330NoGeneralized Data Distribution Iteration2022-06-07-
7GDI-H338225NoGeneralized Data Distribution Iteration2022-06-07-
8MuZero (Res2 Adam)37234.31NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
9IMPALA (deep)33730.55NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
10A2C + SIL33156.7NoSelf-Imitation Learning2018-06-14Code
11A3C FF hs32464.1NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
12Ape-X31655.9NoDistributed Prioritized Experience Replay2018-03-02Code
13NoisyNet-Dueling31533NoNoisy Networks for Exploration2017-06-30Code
14FQF30926.2NoFully Parameterized Quantile Function for Distri...2019-11-05Code
15A3C LSTM hs28889.5NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
16A3C FF (1 day) hs28765.8NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
17IQN28386NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
18ASL DDQN26578.5NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
19DNA24904NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
20Advantage Learning24788.86NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
21Persistent AL24175.79NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
22Prior noop23037.7NoPrioritized Experience Replay2015-11-18Code
23DreamerV221868NoMastering Atari with Discrete World Models2020-10-05Code
24QR-DQN-121395NoDistributional Reinforcement Learning with Quant...2017-10-27Code
25Prior+Duel noop21036.5NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
26Bootstrapped DQN21021.3NoDeep Exploration via Bootstrapped DQN2016-02-15Code
27Prior hs20889.9NoPrioritized Experience Replay2015-11-18Code
28Duel noop20818.2NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
29DQN noop20437.8NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
30DDQN (tuned) noop20130.2NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
31Nature DQN19950No--Code
32Prior+Duel hs15459.2NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
33Duel hs15207.9NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
34DQN hs14992.9NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
35DDQN (tuned) hs14892.5NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
36DDQN+Pop-Art noop14225.2NoLearning values across many orders of magnitude2016-02-24-
37UCT12859.5NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
38MFEC11732NoModel-Free Episodic Control with State Aggregation2020-08-21-
39Gorila8963.4NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
40SARSA7295No---
41Best linear6459NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
42Best Learner6458.8NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
43CURL6235.1NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
44CGP2974NoEvolving simple programs for playing Atari games2018-06-14Code