Video Games on Atari 2600 Krull

Metric: Score (higher is better)

LeaderboardDataset

Loading chart...

Results

Sort:

#	Model↕	Score▼	Extra Data	Paper	Date↕	Code
1	GDI-H3	594540	No	Generalized Data Distribution Iteration	2022-06-07	-
2	MuZero	269358.27	No	Mastering Atari, Go, Chess and Shogi by Planning...	2019-11-19	Code
3	Agent57	251997.31	No	Agent57: Outperforming the Atari Human Benchmark	2020-03-30	Code
4	R2D2	218448.1	No	-	-	Code
5	GDI-I3	97575	No	Generalized Data Distribution Iteration	2022-06-07	-
6	GDI-I3	97575	No	Generalized Data Distribution Iteration	2022-06-07	-
7	MuZero (Res2 Adam)	72570.5	No	Online and Offline Reinforcement Learning by Pla...	2021-04-13	Code
8	DreamerV2	50061	No	Mastering Atari with Discrete World Models	2020-10-05	Code
9	VPN	15930	No	Value Prediction Network	2017-07-11	Code
10	Ape-X	11741.4	No	Distributed Prioritized Experience Replay	2018-03-02	Code
11	Duel noop	11451.9	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
12	QR-DQN-1	11447	No	Distributional Reinforcement Learning with Quant...	2017-10-27	Code
13	DNA	10956	No	DNA: Proximal Policy Optimization with a Dual Ne...	2022-06-20	Code
14	NoisyNet-Dueling	10754	No	Noisy Networks for Exploration	2017-06-30	Code
15	IQN	10707	No	Implicit Quantile Networks for Distributional Re...	2018-06-14	Code
16	A2C + SIL	10614.6	No	Self-Imitation Learning	2018-06-14	Code
17	ASL DDQN	10422.5	No	Train a Real-world Local Path Planner in One Hou...	2023-05-07	Code
18	Prior+Duel noop	10374.4	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
19	DDQN+Pop-Art noop	9745.1	No	Learning values across many orders of magnitude	2016-02-24	-
20	C51 noop	9735	No	A Distributional Perspective on Reinforcement Le...	2017-07-21	Code
21	Prior noop	9728	No	Prioritized Experience Replay	2015-11-18	Code
22	Advantage Learning	9548.92	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
23	CGP	9086.8	No	Evolving simple programs for playing Atari games	2018-06-14	Code
24	Persistent AL	8689.81	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
25	ES FF (1 hour) noop	8647.2	No	Evolution Strategies as a Scalable Alternative t...	2017-03-10	Code
26	Bootstrapped DQN	8627.9	No	Deep Exploration via Bootstrapped DQN	2016-02-15	Code
27	DQN noop	8422.3	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
28	IMPALA (deep)	8147.4	No	IMPALA: Scalable Distributed Deep-RL with Import...	2018-02-05	Code
29	A3C FF (1 day) hs	8066.6	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
30	Duel hs	8051.6	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
31	DDQN (tuned) noop	7920.5	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
32	POP3D	7715.68	No	Policy Optimization With Penalized Point Probabi...	2018-07-02	Code
33	Prior+Duel hs	7658.6	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
34	Prior hs	6872.8	No	Prioritized Experience Replay	2015-11-18	Code
35	DDQN (tuned) hs	6796.1	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
36	Gorila	6363.1	No	Massively Parallel Methods for Deep Reinforcemen...	2015-07-15	Code
37	DQN hs	6206	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
38	A3C LSTM hs	5911.4	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
39	A3C FF hs	5560	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
40	UCT	5037	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
41	CURL	3833.6	No	CURL: Contrastive Unsupervised Representations f...	2020-04-08	Code
42	Nature DQN	3805	No	-	-	Code
43	Best Learner	3371.5	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
44	SARSA	3341	No	-	-	-
45	Rainbow+SEER	3277.5	No	Improving Computational Efficiency in Visual Rei...	2021-03-04	Code

#1GDI-H3SOTA
594540
Score· 2022-06-07
Generalized Data Distribution Iteration
#2MuZeroSOTA
269358.27
Score· 2019-11-19
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model Code
#3Agent57
251997.31
Score· 2020-03-30
Agent57: Outperforming the Atari Human Benchmark Code
#4R2D2
218448.1
Score
No paperCode
#5GDI-I3
97575
Score· 2022-06-07
Generalized Data Distribution Iteration
#6GDI-I3
97575
Score· 2022-06-07
Generalized Data Distribution Iteration
#7MuZero (Res2 Adam)
72570.5
Score· 2021-04-13
Online and Offline Reinforcement Learning by Planning with a Learned Model Code
#8DreamerV2
50061
Score· 2020-10-05
Mastering Atari with Discrete World Models Code
#9VPNSOTA
15930
Score· 2017-07-11
Value Prediction Network Code
#10Ape-X
11741.4
Score· 2018-03-02
Distributed Prioritized Experience Replay Code
#11Duel noopSOTA
11451.9
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#12QR-DQN-1
11447
Score· 2017-10-27
Distributional Reinforcement Learning with Quantile Regression Code
#13DNA
10956
Score· 2022-06-20
DNA: Proximal Policy Optimization with a Dual Network Architecture Code
#14NoisyNet-Dueling
10754
Score· 2017-06-30
Noisy Networks for Exploration Code
#15IQN
10707
Score· 2018-06-14
Implicit Quantile Networks for Distributional Reinforcement Learning Code
#16A2C + SIL
10614.6
Score· 2018-06-14
Self-Imitation Learning Code
#17ASL DDQN
10422.5
Score· 2023-05-07
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity Code
#18Prior+Duel noop
10374.4
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#19DDQN+Pop-Art noop
9745.1
Score· 2016-02-24
Learning values across many orders of magnitude
#20C51 noop
9735
Score· 2017-07-21
A Distributional Perspective on Reinforcement Learning Code
#21Prior noopSOTA
9728
Score· 2015-11-18
Prioritized Experience Replay Code
#22Advantage Learning
9548.92
Score· 2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning Code
#23CGP
9086.8
Score· 2018-06-14
Evolving simple programs for playing Atari games Code
#24Persistent AL
8689.81
Score· 2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning Code
#25ES FF (1 hour) noop
8647.2
Score· 2017-03-10
Evolution Strategies as a Scalable Alternative to Reinforcement Learning Code
#26Bootstrapped DQN
8627.9
Score· 2016-02-15
Deep Exploration via Bootstrapped DQN Code
#27DQN noopSOTA
8422.3
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#28IMPALA (deep)
8147.4
Score· 2018-02-05
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures Code
#29A3C FF (1 day) hs
8066.6
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#30Duel hs
8051.6
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#31DDQN (tuned) noop
7920.5
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#32POP3D
7715.68
Score· 2018-07-02
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization Code
#33Prior+Duel hs
7658.6
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#34Prior hs
6872.8
Score· 2015-11-18
Prioritized Experience Replay Code
#35DDQN (tuned) hs
6796.1
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#36GorilaSOTA
6363.1
Score· 2015-07-15
Massively Parallel Methods for Deep Reinforcement Learning Code
#37DQN hs
6206
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#38A3C LSTM hs
5911.4
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#39A3C FF hs
5560
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#40UCTSOTA
5037
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#41CURL
3833.6
Score· 2020-04-08
CURL: Contrastive Unsupervised Representations for Reinforcement Learning Code
#42Nature DQN
3805
Score
No paperCode
#43Best Learner
3371.5
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#44SARSA
3341
Score
No paper
#45Rainbow+SEER
3277.5
Score· 2021-03-04
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings Code