Video Games on Atari 2600 Crazy Climber

Metric: Score (higher is better)

LeaderboardDataset

Loading chart...

Results

Sort:

#	Model↕	Score▼	Extra Data	Paper	Date↕	Code
1	Agent57	565909.85	No	Agent57: Outperforming the Atari Human Benchmark	2020-03-30	Code
2	MuZero	458315.4	No	Mastering Atari, Go, Chess and Shogi by Planning...	2019-11-19	Code
3	R2D2	366690.7	No	-	-	Code
4	Ape-X	320426	No	Distributed Prioritized Experience Replay	2018-03-02	Code
5	GDI-H3	241170	No	Generalized Data Distribution Iteration	2022-06-07	-
6	Reactor 500M	236422	No	The Reactor: A fast and sample-efficient Actor-C...	2017-04-15	-
7	FQF	223470.6	No	Fully Parameterized Quantile Function for Distri...	2019-11-05	Code
8	GDI-I3	201000	No	Generalized Data Distribution Iteration	2022-06-07	-
9	GDI-I3	201000	No	Generalized Data Distribution Iteration	2022-06-07	-
10	C51 noop	179877	No	A Distributional Perspective on Reinforcement Le...	2017-07-21	Code
11	IQN	179082	No	Implicit Quantile Networks for Distributional Re...	2018-06-14	Code
12	NoisyNet-Dueling	171171	No	Noisy Networks for Exploration	2017-06-30	Code
13	ASL DDQN	166019	No	Train a Real-world Local Path Planner in One Hou...	2023-05-07	Code
14	Prior+Duel noop	162224	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
15	DreamerV2	161839	No	Mastering Atari with Discrete World Models	2020-10-05	Code
16	QR-DQN-1	161196	No	Distributional Reinforcement Learning with Quant...	2017-10-27	Code
17	MuZero (Res2 Adam)	158541.58	No	Online and Offline Reinforcement Learning by Pla...	2021-04-13	Code
18	Duel noop	143570	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
19	Prior noop	141161	No	Prioritized Experience Replay	2015-11-18	Code
20	A3C LSTM hs	138518	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
21	Bootstrapped DQN	137925.9	No	Deep Exploration via Bootstrapped DQN	2016-02-15	Code
22	IMPALA (deep)	136950	No	IMPALA: Scalable Distributed Deep-RL with Import...	2018-02-05	Code
23	DNA	131623	No	DNA: Proximal Policy Optimization with a Dual Ne...	2022-06-20	Code
24	A2C + SIL	130185.8	No	Self-Imitation Learning	2018-06-14	Code
25	Persistent AL	130002.71	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
26	Prior+Duel hs	127853	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
27	Prior hs	127512	No	Prioritized Experience Replay	2015-11-18	Code
28	Duel hs	124566	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
29	Advantage Learning	123410.71	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
30	POP3D	120247.33	No	Policy Optimization With Penalized Point Probabi...	2018-07-02	Code
31	DDQN+Pop-Art noop	119679	No	Learning values across many orders of magnitude	2016-02-24	-
32	DDQN (tuned) noop	117282	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
33	Nature DQN	114103	No	-	-	Code
34	DDQN (tuned) hs	113782	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
35	A3C FF hs	112646	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
36	DQN noop	110763	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
37	A3C FF (1 day) hs	101624	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
38	UCT	98172.2	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
39	DQN hs	98128	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
40	Gorila	65451	No	Massively Parallel Methods for Deep Reinforcemen...	2015-07-15	Code
41	Discrete Latent Space World Model (VQ-VAE)	59609.4	No	Smaller World Models for Reinforcement Learning	2020-10-12	-
42	VPN	54119	No	Value Prediction Network	2017-07-11	Code
43	Rainbow+SEER	28066	No	Improving Computational Efficiency in Visual Rei...	2021-03-04	Code
44	CURL	27805.6	No	CURL: Contrastive Unsupervised Representations f...	2020-04-08	Code
45	ES FF (1 hour) noop	26430	No	Evolution Strategies as a Scalable Alternative t...	2017-03-10	Code
46	Best Learner	23410.6	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
47	CGP	12900	No	Evolving simple programs for playing Atari games	2018-06-14	Code
48	SAC	3668.7	No	Soft Actor-Critic for Discrete Action Settings	2019-10-16	Code
49	SARSA	149.8	No	-	-	-

#1Agent57SOTA
565909.85
Score· 2020-03-30
Agent57: Outperforming the Atari Human Benchmark Code
#2MuZeroSOTA
458315.4
Score· 2019-11-19
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model Code
#3R2D2
366690.7
Score
No paperCode
#4Ape-XSOTA
320426
Score· 2018-03-02
Distributed Prioritized Experience Replay Code
#5GDI-H3
241170
Score· 2022-06-07
Generalized Data Distribution Iteration
#6Reactor 500MSOTA
236422
Score· 2017-04-15
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
#7FQF
223470.6
Score· 2019-11-05
Fully Parameterized Quantile Function for Distributional Reinforcement Learning Code
#8GDI-I3
201000
Score· 2022-06-07
Generalized Data Distribution Iteration
#9GDI-I3
201000
Score· 2022-06-07
Generalized Data Distribution Iteration
#10C51 noop
179877
Score· 2017-07-21
A Distributional Perspective on Reinforcement Learning Code
#11IQN
179082
Score· 2018-06-14
Implicit Quantile Networks for Distributional Reinforcement Learning Code
#12NoisyNet-Dueling
171171
Score· 2017-06-30
Noisy Networks for Exploration Code
#13ASL DDQN
166019
Score· 2023-05-07
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity Code
#14Prior+Duel noopSOTA
162224
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#15DreamerV2
161839
Score· 2020-10-05
Mastering Atari with Discrete World Models Code
#16QR-DQN-1
161196
Score· 2017-10-27
Distributional Reinforcement Learning with Quantile Regression Code
#17MuZero (Res2 Adam)
158541.58
Score· 2021-04-13
Online and Offline Reinforcement Learning by Planning with a Learned Model Code
#18Duel noop
143570
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#19Prior noopSOTA
141161
Score· 2015-11-18
Prioritized Experience Replay Code
#20A3C LSTM hs
138518
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#21Bootstrapped DQN
137925.9
Score· 2016-02-15
Deep Exploration via Bootstrapped DQN Code
#22IMPALA (deep)
136950
Score· 2018-02-05
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures Code
#23DNA
131623
Score· 2022-06-20
DNA: Proximal Policy Optimization with a Dual Network Architecture Code
#24A2C + SIL
130185.8
Score· 2018-06-14
Self-Imitation Learning Code
#25Persistent AL
130002.71
Score· 2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning Code
#26Prior+Duel hsSOTA
127853
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#27Prior hs
127512
Score· 2015-11-18
Prioritized Experience Replay Code
#28Duel hs
124566
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#29Advantage Learning
123410.71
Score· 2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning Code
#30POP3D
120247.33
Score· 2018-07-02
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization Code
#31DDQN+Pop-Art noop
119679
Score· 2016-02-24
Learning values across many orders of magnitude
#32DDQN (tuned) noop
117282
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#33Nature DQN
114103
Score
No paperCode
#34DDQN (tuned) hs
113782
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#35A3C FF hs
112646
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#36DQN noop
110763
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#37A3C FF (1 day) hs
101624
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#38UCTSOTA
98172.2
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#39DQN hs
98128
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#40Gorila
65451
Score· 2015-07-15
Massively Parallel Methods for Deep Reinforcement Learning Code
#41Discrete Latent Space World Model (VQ-VAE)
59609.4
Score· 2020-10-12
Smaller World Models for Reinforcement Learning
#42VPN
54119
Score· 2017-07-11
Value Prediction Network Code
#43Rainbow+SEER
28066
Score· 2021-03-04
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings Code
#44CURL
27805.6
Score· 2020-04-08
CURL: Contrastive Unsupervised Representations for Reinforcement Learning Code
#45ES FF (1 hour) noop
26430
Score· 2017-03-10
Evolution Strategies as a Scalable Alternative to Reinforcement Learning Code
#46Best Learner
23410.6
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#47CGP
12900
Score· 2018-06-14
Evolving simple programs for playing Atari games Code
#48SAC
3668.7
Score· 2019-10-16
Soft Actor-Critic for Discrete Action Settings Code
#49SARSA
149.8
Score
No paper