Video Games on Atari 2600 Assault

Metric: Score (higher is better)

LeaderboardDataset

Loading chart...

Results

Sort:

#	Model↕	Score▼	Extra Data	Paper	Date↕	Code
1	MuZero	143972.03	No	Mastering Atari, Go, Chess and Shogi by Planning...	2019-11-19	Code
2	R2D2	108197	No	-	-	Code
3	GDI-H3	97155	No	Generalized Data Distribution Iteration	2022-06-07	-
4	Agent57	67212.67	No	Agent57: Outperforming the Atari Human Benchmark	2020-03-30	Code
5	GDI-I3	63876	No	Generalized Data Distribution Iteration	2022-06-07	-
6	MuZero (Res2 Adam)	33292.22	No	Online and Offline Reinforcement Learning by Pla...	2021-04-13	Code
7	IQN	29091	No	Implicit Quantile Networks for Distributional Re...	2018-06-14	Code
8	Ape-X	24559.4	No	Distributed Prioritized Experience Replay	2018-03-02	Code
9	DreamerV2	23625	No	Mastering Atari with Discrete World Models	2020-10-05	Code
10	QR-DQN-1	22012	No	Distributional Reinforcement Learning with Quant...	2017-10-27	Code
11	IMPALA (deep)	19148.47	No	IMPALA: Scalable Distributed Deep-RL with Import...	2018-02-05	Code
12	DNA	16293	No	DNA: Proximal Policy Optimization with a Dual Ne...	2022-06-20	Code
13	A3C LSTM hs	14497.9	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
14	ASL DDQN	14372.8	No	Train a Real-world Local Path Planner in One Hou...	2023-05-07	Code
15	Prior+Duel noop	11477	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
16	NoisyNet-Dueling	11231	No	Noisy Networks for Exploration	2017-06-30	Code
17	Prior+Duel hs	10950.6	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
18	Prior+Duel hs	10950.6	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
19	DDQN+Pop-Art noop	9011.6	No	Learning values across many orders of magnitude	2016-02-24	-
20	Reactor 500M	8323.3	No	The Reactor: A fast and sample-efficient Actor-C...	2017-04-15	-
21	Bootstrapped DQN	8047.1	No	Deep Exploration via Bootstrapped DQN	2016-02-15	Code
22	Prior noop	7672.1	No	Prioritized Experience Replay	2015-11-18	Code
23	C51 noop	7203	No	A Distributional Perspective on Reinforcement Le...	2017-07-21	Code
24	Prior hs	6548.9	No	Prioritized Experience Replay	2015-11-18	Code
25	DDQN (tuned) hs	6060.8	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
26	A3C FF hs	5474.9	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
27	POP3D	5400.13	No	Policy Optimization With Penalized Point Probabi...	2018-07-02	Code
28	DDQN (tuned) noop	5393.2	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
29	Duel noop	4621	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
30	DQN noop	4280.4	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
31	Duel hs	3994.8	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
32	A3C FF (1 day) hs	3746.1	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
33	Advantage Learning	3661.51	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
34	DQN hs	3489.3	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
35	Nature DQN	3359	No	-	-	Code
36	Persistent AL	3304.33	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
37	A2C + SIL	1812	No	Self-Imitation Learning	2018-06-14	Code
38	ES FF (1 hour) noop	1673.9	No	Evolution Strategies as a Scalable Alternative t...	2017-03-10	Code
39	UCT	1512.2	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
40	Gorila	1195.8	No	Massively Parallel Methods for Deep Reinforcemen...	2015-07-15	Code
41	CGP	890.4	No	Evolving simple programs for playing Atari games	2018-06-14	Code
42	Best Learner	628	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
43	CURL	543.7	No	CURL: Contrastive Unsupervised Representations f...	2020-04-08	Code
44	SARSA	537	No	-	-	-
45	SAC	350	No	Soft Actor-Critic for Discrete Action Settings	2019-10-16	Code

#1MuZeroSOTA
143972.03
Score· 2019-11-19
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model Code
#2R2D2
108197
Score
No paperCode
#3GDI-H3
97155
Score· 2022-06-07
Generalized Data Distribution Iteration
#4Agent57
67212.67
Score· 2020-03-30
Agent57: Outperforming the Atari Human Benchmark Code
#5GDI-I3
63876
Score· 2022-06-07
Generalized Data Distribution Iteration
#6MuZero (Res2 Adam)
33292.22
Score· 2021-04-13
Online and Offline Reinforcement Learning by Planning with a Learned Model Code
#7IQNSOTA
29091
Score· 2018-06-14
Implicit Quantile Networks for Distributional Reinforcement Learning Code
#8Ape-XSOTA
24559.4
Score· 2018-03-02
Distributed Prioritized Experience Replay Code
#9DreamerV2
23625
Score· 2020-10-05
Mastering Atari with Discrete World Models Code
#10QR-DQN-1SOTA
22012
Score· 2017-10-27
Distributional Reinforcement Learning with Quantile Regression Code
#11IMPALA (deep)
19148.47
Score· 2018-02-05
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures Code
#12DNA
16293
Score· 2022-06-20
DNA: Proximal Policy Optimization with a Dual Network Architecture Code
#13A3C LSTM hsSOTA
14497.9
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#14ASL DDQN
14372.8
Score· 2023-05-07
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity Code
#15Prior+Duel noopSOTA
11477
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#16NoisyNet-Dueling
11231
Score· 2017-06-30
Noisy Networks for Exploration Code
#17Prior+Duel hs
10950.6
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#18Prior+Duel hs
10950.6
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#19DDQN+Pop-Art noop
9011.6
Score· 2016-02-24
Learning values across many orders of magnitude
#20Reactor 500M
8323.3
Score· 2017-04-15
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
#21Bootstrapped DQN
8047.1
Score· 2016-02-15
Deep Exploration via Bootstrapped DQN Code
#22Prior noopSOTA
7672.1
Score· 2015-11-18
Prioritized Experience Replay Code
#23C51 noop
7203
Score· 2017-07-21
A Distributional Perspective on Reinforcement Learning Code
#24Prior hs
6548.9
Score· 2015-11-18
Prioritized Experience Replay Code
#25DDQN (tuned) hsSOTA
6060.8
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#26A3C FF hs
5474.9
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#27POP3D
5400.13
Score· 2018-07-02
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization Code
#28DDQN (tuned) noop
5393.2
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#29Duel noop
4621
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#30DQN noop
4280.4
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#31Duel hs
3994.8
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#32A3C FF (1 day) hs
3746.1
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#33Advantage Learning
3661.51
Score· 2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning Code
#34DQN hs
3489.3
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#35Nature DQN
3359
Score
No paperCode
#36Persistent AL
3304.33
Score· 2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning Code
#37A2C + SIL
1812
Score· 2018-06-14
Self-Imitation Learning Code
#38ES FF (1 hour) noop
1673.9
Score· 2017-03-10
Evolution Strategies as a Scalable Alternative to Reinforcement Learning Code
#39UCTSOTA
1512.2
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#40Gorila
1195.8
Score· 2015-07-15
Massively Parallel Methods for Deep Reinforcement Learning Code
#41CGP
890.4
Score· 2018-06-14
Evolving simple programs for playing Atari games Code
#42Best Learner
628
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#43CURL
543.7
Score· 2020-04-08
CURL: Contrastive Unsupervised Representations for Reinforcement Learning Code
#44SARSA
537
Score
No paper
#45SAC
350
Score· 2019-10-16
Soft Actor-Critic for Discrete Action Settings Code