Atari Games on Atari 2600 Beam Rider

Metric: Score (higher is better)

LeaderboardDataset

Loading chart...

Results

Sort:

#	Model↕	Score▼	Extra Data	Paper	Date↕	Code
1	MuZero	454993.53	No	Mastering Atari, Go, Chess and Shogi by Planning...	2019-11-19	Code
2	GDI-H3	422890	No	Generalized Data Distribution Iteration	2022-06-07	-
3	MuZero (Res2 Adam)	333077.44	No	Online and Offline Reinforcement Learning by Pla...	2021-04-13	Code
4	Agent57	300509.8	No	Agent57: Outperforming the Atari Human Benchmark	2020-03-30	Code
5	R2D2	188257.4	No	-	-	Code
6	GDI-I3	162100	No	Generalized Data Distribution Iteration	2022-06-07	-
7	GDI-I3	162100	No	Generalized Data Distribution Iteration	2022-06-07	-
8	Ape-X	63305.2	No	Distributed Prioritized Experience Replay	2018-03-02	Code
9	IQN	42776	No	Implicit Quantile Networks for Distributional Re...	2018-06-14	Code
10	Prior+Duel hs	37412.2	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
11	QR-DQN-1	34821	No	Distributional Reinforcement Learning with Quant...	2017-10-27	Code
12	IMPALA (deep)	32463.47	No	IMPALA: Scalable Distributed Deep-RL with Import...	2018-02-05	Code
13	Prior hs	31181.3	No	Prioritized Experience Replay	2015-11-18	Code
14	Prior+Duel noop	30276.5	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
15	ASL DDQN	26841.6	No	Train a Real-world Local Path Planner in One Hou...	2023-05-07	Code
16	A3C LSTM hs	24622.2	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
17	Bootstrapped DQN	23429.8	No	Deep Exploration via Bootstrapped DQN	2016-02-15	Code
18	Prior noop	23384.2	No	Prioritized Experience Replay	2015-11-18	Code
19	NoisyNet-Dueling	23134	No	Noisy Networks for Exploration	2017-06-30	Code
20	A3C FF hs	22707.9	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
21	DNA	20393	No	DNA: Proximal Policy Optimization with a Dual Ne...	2022-06-20	Code
22	DreamerV2	18646	No	Mastering Atari with Discrete World Models	2020-10-05	Code
23	DDQN (tuned) hs	17417.2	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
24	DDRL A3C	14900	No	Distributed Deep Reinforcement Learning: Learn h...	2018-01-09	Code
25	Duel hs	14591.3	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
26	C51 noop	14074	No	A Distributional Perspective on Reinforcement Le...	2017-07-21	Code
27	DDQN (tuned) noop	13772.8	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
28	A3C FF (1 day) hs	13235.9	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
29	Persistent AL	13145.34	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
30	Duel noop	12164	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
31	Reactor 500M	11033.4	No	The Reactor: A fast and sample-efficient Actor-C...	2017-04-15	-
32	Advantage Learning	10054.58	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
33	DQN hs	9743.2	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
34	DQN noop	8627.5	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
35	DDQN+Pop-Art noop	8299.4	No	Learning values across many orders of magnitude	2016-02-24	-
36	Nature DQN	6846	No	-	-	Code
37	UCT	6624.6	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
38	MAC	6072	No	Mean Actor Critic	2017-09-01	Code
39	RIMs-PPO	5320	No	Recurrent Independent Mechanisms	2019-09-24	Code
40	DQN Best	5184	No	Playing Atari with Deep Reinforcement Learning	2013-12-19	Code
41	POP3D	4549	No	Policy Optimization With Penalized Point Probabi...	2018-07-02	Code
42	Gorila	3822.1	No	Massively Parallel Methods for Deep Reinforcemen...	2015-07-15	Code
43	A2C + SIL	2366.2	No	Self-Imitation Learning	2018-06-14	Code
44	SARSA	1743	No	-	-	-
45	CGP	1341.6	No	Evolving simple programs for playing Atari games	2018-06-14	Code
46	Best Learner	929.4	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
47	ES FF (1 hour) noop	744	No	Evolution Strategies as a Scalable Alternative t...	2017-03-10	Code
48	SAC	432.1	No	Soft Actor-Critic for Discrete Action Settings	2019-10-16	Code

#1MuZeroSOTA
454993.53
Score· 2019-11-19
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model Code
#2GDI-H3
422890
Score· 2022-06-07
Generalized Data Distribution Iteration
#3MuZero (Res2 Adam)
333077.44
Score· 2021-04-13
Online and Offline Reinforcement Learning by Planning with a Learned Model Code
#4Agent57
300509.8
Score· 2020-03-30
Agent57: Outperforming the Atari Human Benchmark Code
#5R2D2
188257.4
Score
No paperCode
#6GDI-I3
162100
Score· 2022-06-07
Generalized Data Distribution Iteration
#7GDI-I3
162100
Score· 2022-06-07
Generalized Data Distribution Iteration
#8Ape-XSOTA
63305.2
Score· 2018-03-02
Distributed Prioritized Experience Replay Code
#9IQN
42776
Score· 2018-06-14
Implicit Quantile Networks for Distributional Reinforcement Learning Code
#10Prior+Duel hsSOTA
37412.2
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#11QR-DQN-1
34821
Score· 2017-10-27
Distributional Reinforcement Learning with Quantile Regression Code
#12IMPALA (deep)
32463.47
Score· 2018-02-05
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures Code
#13Prior hs
31181.3
Score· 2015-11-18
Prioritized Experience Replay Code
#14Prior+Duel noop
30276.5
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#15ASL DDQN
26841.6
Score· 2023-05-07
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity Code
#16A3C LSTM hs
24622.2
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#17Bootstrapped DQN
23429.8
Score· 2016-02-15
Deep Exploration via Bootstrapped DQN Code
#18Prior noop
23384.2
Score· 2015-11-18
Prioritized Experience Replay Code
#19NoisyNet-Dueling
23134
Score· 2017-06-30
Noisy Networks for Exploration Code
#20A3C FF hs
22707.9
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#21DNA
20393
Score· 2022-06-20
DNA: Proximal Policy Optimization with a Dual Network Architecture Code
#22DreamerV2
18646
Score· 2020-10-05
Mastering Atari with Discrete World Models Code
#23DDQN (tuned) hs
17417.2
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#24DDRL A3C
14900
Score· 2018-01-09
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes Code
#25Duel hs
14591.3
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#26C51 noop
14074
Score· 2017-07-21
A Distributional Perspective on Reinforcement Learning Code
#27DDQN (tuned) noop
13772.8
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#28A3C FF (1 day) hs
13235.9
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#29Persistent AL
13145.34
Score· 2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning Code
#30Duel noop
12164
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#31Reactor 500M
11033.4
Score· 2017-04-15
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
#32Advantage Learning
10054.58
Score· 2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning Code
#33DQN hs
9743.2
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#34DQN noop
8627.5
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#35DDQN+Pop-Art noop
8299.4
Score· 2016-02-24
Learning values across many orders of magnitude
#36Nature DQN
6846
Score
No paperCode
#37UCTSOTA
6624.6
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#38MAC
6072
Score· 2017-09-01
Mean Actor Critic Code
#39RIMs-PPO
5320
Score· 2019-09-24
Recurrent Independent Mechanisms Code
#40DQN Best
5184
Score· 2013-12-19
Playing Atari with Deep Reinforcement Learning Code
#41POP3D
4549
Score· 2018-07-02
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization Code
#42Gorila
3822.1
Score· 2015-07-15
Massively Parallel Methods for Deep Reinforcement Learning Code
#43A2C + SIL
2366.2
Score· 2018-06-14
Self-Imitation Learning Code
#44SARSA
1743
Score
No paper
#45CGP
1341.6
Score· 2018-06-14
Evolving simple programs for playing Atari games Code
#46Best Learner
929.4
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#47ES FF (1 hour) noop
744
Score· 2017-03-10
Evolution Strategies as a Scalable Alternative to Reinforcement Learning Code
#48SAC
432.1
Score· 2019-10-16
Soft Actor-Critic for Discrete Action Settings Code