Atari Games on Atari 2600 Zaxxon

Metric: Score (higher is better)

LeaderboardDataset

Loading chart...

Results

Sort:

#	Model↕	Score▼	Extra Data	Paper	Date↕	Code
1	MuZero	725853.9	No	Mastering Atari, Go, Chess and Shogi by Planning...	2019-11-19	Code
2	Agent57	249808.9	No	Agent57: Outperforming the Atari Human Benchmark	2020-03-30	Code
3	R2D2	224910.7	No	-	-	Code
4	GDI-H3	216020	No	Generalized Data Distribution Iteration	2022-06-07	-
5	MuZero (Res2 Adam)	154131.86	No	Online and Offline Reinforcement Learning by Pla...	2021-04-13	Code
6	GDI-I3	109140	No	Generalized Data Distribution Iteration	2022-06-07	-
7	DreamerV2	50699	No	Mastering Atari with Discrete World Models	2020-10-05	Code
8	Ape-X	42285.5	No	Distributed Prioritized Experience Replay	2018-03-02	Code
9	IMPALA (deep)	32935.5	No	IMPALA: Scalable Distributed Deep-RL with Import...	2018-02-05	Code
10	A3C FF hs	24622	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
11	A3C LSTM hs	23519	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
12	UCT	22610	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
13	DNA	22588	No	DNA: Proximal Policy Optimization with a Dual Ne...	2022-06-20	Code
14	IQN	21772	No	Implicit Quantile Networks for Distributional Re...	2018-06-14	Code
15	ASL DDQN	16420	No	Train a Real-world Local Path Planner in One Hou...	2023-05-07	Code
16	RIMs-PPO	15000	No	Recurrent Independent Mechanisms	2019-09-24	Code
17	NoisyNet-Dueling	14874	No	Noisy Networks for Exploration	2017-06-30	Code
18	DDQN+Pop-Art noop	14402	No	Learning values across many orders of magnitude	2016-02-24	-
19	Prior+Duel noop	13886	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
20	QR-DQN-1	13112	No	Distributional Reinforcement Learning with Quant...	2017-10-27	Code
21	Duel noop	12944	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
22	Bootstrapped DQN	11491.7	No	Deep Exploration via Bootstrapped DQN	2016-02-15	Code
23	Prior+Duel hs	11320	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
24	C51 noop	10513	No	A Distributional Perspective on Reinforcement Le...	2017-07-21	Code
25	Prior noop	10469	No	Prioritized Experience Replay	2015-11-18	Code
26	Duel hs	10164	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
27	DDQN (tuned) noop	10163	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
28	Prior hs	9474	No	Prioritized Experience Replay	2015-11-18	Code
29	POP3D	9472	No	Policy Optimization With Penalized Point Probabi...	2018-07-02	Code
30	A2C + SIL	9164.2	No	Self-Imitation Learning	2018-06-14	Code
31	Advantage Learning	9129.61	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
32	DDQN (tuned) hs	8593	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
33	ES FF (1 hour) noop	6380	No	Evolution Strategies as a Scalable Alternative t...	2017-03-10	Code
34	Gorila	6159.4	No	Massively Parallel Methods for Deep Reinforcemen...	2015-07-15	Code
35	DQN noop	5363	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
36	Nature DQN	4977	No	-	-	Code
37	DQN hs	4412	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
38	Best Learner	3365.1	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
39	CGP	2980	No	Evolving simple programs for playing Atari games	2018-06-14	Code
40	A3C FF (1 day) hs	2659	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
41	SARSA	21.4	No	-	-	-

#1MuZeroSOTA
725853.9
Score· 2019-11-19
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model Code
#2Agent57
249808.9
Score· 2020-03-30
Agent57: Outperforming the Atari Human Benchmark Code
#3R2D2
224910.7
Score
No paperCode
#4GDI-H3
216020
Score· 2022-06-07
Generalized Data Distribution Iteration
#5MuZero (Res2 Adam)
154131.86
Score· 2021-04-13
Online and Offline Reinforcement Learning by Planning with a Learned Model Code
#6GDI-I3
109140
Score· 2022-06-07
Generalized Data Distribution Iteration
#7DreamerV2
50699
Score· 2020-10-05
Mastering Atari with Discrete World Models Code
#8Ape-XSOTA
42285.5
Score· 2018-03-02
Distributed Prioritized Experience Replay Code
#9IMPALA (deep)SOTA
32935.5
Score· 2018-02-05
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures Code
#10A3C FF hsSOTA
24622
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#11A3C LSTM hs
23519
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#12UCTSOTA
22610
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#13DNA
22588
Score· 2022-06-20
DNA: Proximal Policy Optimization with a Dual Network Architecture Code
#14IQN
21772
Score· 2018-06-14
Implicit Quantile Networks for Distributional Reinforcement Learning Code
#15ASL DDQN
16420
Score· 2023-05-07
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity Code
#16RIMs-PPO
15000
Score· 2019-09-24
Recurrent Independent Mechanisms Code
#17NoisyNet-Dueling
14874
Score· 2017-06-30
Noisy Networks for Exploration Code
#18DDQN+Pop-Art noop
14402
Score· 2016-02-24
Learning values across many orders of magnitude
#19Prior+Duel noop
13886
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#20QR-DQN-1
13112
Score· 2017-10-27
Distributional Reinforcement Learning with Quantile Regression Code
#21Duel noop
12944
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#22Bootstrapped DQN
11491.7
Score· 2016-02-15
Deep Exploration via Bootstrapped DQN Code
#23Prior+Duel hs
11320
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#24C51 noop
10513
Score· 2017-07-21
A Distributional Perspective on Reinforcement Learning Code
#25Prior noop
10469
Score· 2015-11-18
Prioritized Experience Replay Code
#26Duel hs
10164
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#27DDQN (tuned) noop
10163
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#28Prior hs
9474
Score· 2015-11-18
Prioritized Experience Replay Code
#29POP3D
9472
Score· 2018-07-02
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization Code
#30A2C + SIL
9164.2
Score· 2018-06-14
Self-Imitation Learning Code
#31Advantage Learning
9129.61
Score· 2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning Code
#32DDQN (tuned) hs
8593
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#33ES FF (1 hour) noop
6380
Score· 2017-03-10
Evolution Strategies as a Scalable Alternative to Reinforcement Learning Code
#34Gorila
6159.4
Score· 2015-07-15
Massively Parallel Methods for Deep Reinforcement Learning Code
#35DQN noop
5363
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#36Nature DQN
4977
Score
No paperCode
#37DQN hs
4412
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#38Best Learner
3365.1
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#39CGP
2980
Score· 2018-06-14
Evolving simple programs for playing Atari games Code
#40A3C FF (1 day) hs
2659
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#41SARSA
21.4
Score
No paper