TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Enduro

Atari Games on Atari 2600 Enduro

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1GDI-I314330NoGeneralized Data Distribution Iteration2022-06-07-
2GDI-I314330NoGeneralized Data Distribution Iteration2022-06-07-
3GDI-H314300NoGeneralized Data Distribution Iteration2022-06-07-
4C51 noop3454NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
5MuZero2382.44NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
6R2D22372.7No--Code
7Agent572367.71NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
8MuZero (Res2 Adam)2365.81NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
9IQN2359NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
10QR-DQN-12355NoDistributional Reinforcement Learning with Quant...2017-10-27Code
11Prior+Duel noop2306.4NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
12Duel noop2258.2NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
13Reactor 500M2224.2NoThe Reactor: A fast and sample-efficient Actor-C...2017-04-15-
14Prior+Duel hs2223.9NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
15Ape-X2177.4NoDistributed Prioritized Experience Replay2018-03-02Code
16ASL DDQN2103.1NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
17Prior noop2093NoPrioritized Experience Replay2015-11-18Code
18Duel hs2077.4NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
19DNA2059NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
20NoisyNet-Dueling2013NoNoisy Networks for Exploration2017-06-30Code
21DDQN+Pop-Art noop2002.1NoLearning values across many orders of magnitude2016-02-24-
22Prior hs1831NoPrioritized Experience Replay2015-11-18Code
23DreamerV21656NoMastering Atari with Discrete World Models2020-10-05Code
24Bootstrapped DQN1591NoDeep Exploration via Bootstrapped DQN2016-02-15Code
25Persistent AL1343.1NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
26Advantage Learning1252.7NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
27DDQN (tuned) hs1216.6NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
28DDQN (tuned) noop1211.8NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
29A2C + SIL1205.1NoSelf-Imitation Learning2018-06-14Code
30Rational DQN Average1043NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
31Recurrent Rational DQN Average957NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
32DQN noop729NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
33DQN Best661NoPlaying Atari with Deep Reinforcement Learning2013-12-19Code
34DQN hs626.7NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
35POP3D459.85NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
36VPN382NoValue Prediction Network2017-07-11Code
37Nature DQN301.8No--Code
38UCT286.3NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
39SARSA159.4No---
40Best Learner129.1NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
41ES FF (1 hour) noop95NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
42Gorila71NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
43CGP56.8NoEvolving simple programs for playing Atari games2018-06-14Code
44SAC0.8NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code
45IMPALA (deep)0No--Code
46A3C FF (1 day) hs-82.2NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
47A3C FF hs-82.5NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
48A3C LSTM hs-82.5NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code