Atari Games on Atari 2600 Road Runner

Metric: Score (higher is better)

LeaderboardDataset

Loading chart...

Results

Sort:

#	Model↕	Score▼	Extra Data	Paper	Date↕	Code
1	GDI-H3	999999	No	Generalized Data Distribution Iteration	2022-06-07	-
2	GDI-I3	878600	No	Generalized Data Distribution Iteration	2022-06-07	-
3	GDI-I3	878600	No	Generalized Data Distribution Iteration	2022-06-07	-
4	MuZero	613411.8	No	Mastering Atari, Go, Chess and Shogi by Planning...	2019-11-19	Code
5	R2D2	599246.7	No	-	-	Code
6	MuZero (Res2 Adam)	531097	No	Online and Offline Reinforcement Learning by Pla...	2021-04-13	Code
7	Agent57	243025.8	No	Agent57: Outperforming the Atari Human Benchmark	2020-03-30	Code
8	NoisyNet-Dueling	234352	No	Noisy Networks for Exploration	2017-06-30	Code
9	Ape-X	222234.5	No	Distributed Prioritized Experience Replay	2018-03-02	Code
10	DreamerV2	203576	No	Mastering Atari with Discrete World Models	2020-10-05	Code
11	A3C LSTM hs	73949	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
12	Duel noop	69524	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
13	QR-DQN-1	64262	No	Distributional Reinforcement Learning with Quant...	2017-10-27	Code
14	Prior+Duel noop	62151	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
15	DNA	61713	No	DNA: Proximal Policy Optimization with a Dual Ne...	2022-06-20	Code
16	Duel hs	58549	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
17	IQN	57900	No	Implicit Quantile Networks for Distributional Re...	2018-06-14	Code
18	Prior noop	57608	No	Prioritized Experience Replay	2015-11-18	Code
19	IMPALA (deep)	57121	No	IMPALA: Scalable Distributed Deep-RL with Import...	2018-02-05	Code
20	A2C + SIL	57071.7	No	Self-Imitation Learning	2018-06-14	Code
21	ASL DDQN	56520	No	Train a Real-world Local Path Planner in One Hou...	2023-05-07	Code
22	C51 noop	55839	No	A Distributional Perspective on Reinforcement Le...	2017-07-21	Code
23	Prior+Duel hs	54630	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
24	Advantage Learning	52351.23	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
25	Prior hs	52264	No	Prioritized Experience Replay	2015-11-18	Code
26	Bootstrapped DQN	51500	No	Deep Exploration via Bootstrapped DQN	2016-02-15	Code
27	DDQN+Pop-Art noop	47770	No	Learning values across many orders of magnitude	2016-02-24	-
28	POP3D	44679.67	No	Policy Optimization With Penalized Point Probabi...	2018-07-02	Code
29	DDQN (tuned) noop	44127	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
30	DDQN (tuned) hs	43156	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
31	Gorila	43079.8	No	Massively Parallel Methods for Deep Reinforcemen...	2015-07-15	Code
32	DQN noop	39544	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
33	UCT	38725	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
34	DQN hs	35215	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
35	A3C FF hs	34216	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
36	A3C FF (1 day) hs	31769	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
37	Nature DQN	18257	No	-	-	Code
38	ES FF (1 hour) noop	16590	No	Evolution Strategies as a Scalable Alternative t...	2017-03-10	Code
39	Rainbow+SEER	11794	No	Improving Computational Efficiency in Visual Rei...	2021-03-04	Code
40	CGP	8960	No	Evolving simple programs for playing Atari games	2018-06-14	Code
41	CURL	6786.7	No	CURL: Contrastive Unsupervised Representations f...	2020-04-08	Code
42	SAC	305.3	No	Soft Actor-Critic for Discrete Action Settings	2019-10-16	Code
43	SARSA	89.1	No	-	-	-
44	Best Learner	67.7	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code

#1GDI-H3SOTA
999999
Score· 2022-06-07
Generalized Data Distribution Iteration
#2GDI-I3
878600
Score· 2022-06-07
Generalized Data Distribution Iteration
#3GDI-I3
878600
Score· 2022-06-07
Generalized Data Distribution Iteration
#4MuZeroSOTA
613411.8
Score· 2019-11-19
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model Code
#5R2D2
599246.7
Score
No paperCode
#6MuZero (Res2 Adam)
531097
Score· 2021-04-13
Online and Offline Reinforcement Learning by Planning with a Learned Model Code
#7Agent57
243025.8
Score· 2020-03-30
Agent57: Outperforming the Atari Human Benchmark Code
#8NoisyNet-DuelingSOTA
234352
Score· 2017-06-30
Noisy Networks for Exploration Code
#9Ape-X
222234.5
Score· 2018-03-02
Distributed Prioritized Experience Replay Code
#10DreamerV2
203576
Score· 2020-10-05
Mastering Atari with Discrete World Models Code
#11A3C LSTM hsSOTA
73949
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#12Duel noopSOTA
69524
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#13QR-DQN-1
64262
Score· 2017-10-27
Distributional Reinforcement Learning with Quantile Regression Code
#14Prior+Duel noop
62151
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#15DNA
61713
Score· 2022-06-20
DNA: Proximal Policy Optimization with a Dual Network Architecture Code
#16Duel hs
58549
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#17IQN
57900
Score· 2018-06-14
Implicit Quantile Networks for Distributional Reinforcement Learning Code
#18Prior noopSOTA
57608
Score· 2015-11-18
Prioritized Experience Replay Code
#19IMPALA (deep)
57121
Score· 2018-02-05
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures Code
#20A2C + SIL
57071.7
Score· 2018-06-14
Self-Imitation Learning Code
#21ASL DDQN
56520
Score· 2023-05-07
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity Code
#22C51 noop
55839
Score· 2017-07-21
A Distributional Perspective on Reinforcement Learning Code
#23Prior+Duel hsSOTA
54630
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#24Advantage Learning
52351.23
Score· 2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning Code
#25Prior hs
52264
Score· 2015-11-18
Prioritized Experience Replay Code
#26Bootstrapped DQN
51500
Score· 2016-02-15
Deep Exploration via Bootstrapped DQN Code
#27DDQN+Pop-Art noop
47770
Score· 2016-02-24
Learning values across many orders of magnitude
#28POP3D
44679.67
Score· 2018-07-02
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization Code
#29DDQN (tuned) noop
44127
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#30DDQN (tuned) hs
43156
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#31GorilaSOTA
43079.8
Score· 2015-07-15
Massively Parallel Methods for Deep Reinforcement Learning Code
#32DQN noop
39544
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#33UCTSOTA
38725
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#34DQN hs
35215
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#35A3C FF hs
34216
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#36A3C FF (1 day) hs
31769
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#37Nature DQN
18257
Score
No paperCode
#38ES FF (1 hour) noop
16590
Score· 2017-03-10
Evolution Strategies as a Scalable Alternative to Reinforcement Learning Code
#39Rainbow+SEER
11794
Score· 2021-03-04
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings Code
#40CGP
8960
Score· 2018-06-14
Evolving simple programs for playing Atari games Code
#41CURL
6786.7
Score· 2020-04-08
CURL: Contrastive Unsupervised Representations for Reinforcement Learning Code
#42SAC
305.3
Score· 2019-10-16
Soft Actor-Critic for Discrete Action Settings Code
#43SARSA
89.1
Score
No paper
#44Best Learner
67.7
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code