Atari Games on Atari 2600 Asteroids

Metric: Score (higher is better)

LeaderboardDataset

Loading chart...

Results

Sort:

#	Model↕	Score▼	Extra Data	Paper	Date↕	Code
1	GDI-H3	760005	No	Generalized Data Distribution Iteration	2022-06-07	-
2	GDI-I3	751970	No	Generalized Data Distribution Iteration	2022-06-07	-
3	MuZero	678558.64	No	Mastering Atari, Go, Chess and Shogi by Planning...	2019-11-19	Code
4	MuZero (Res2 Adam)	476412	No	Online and Offline Reinforcement Learning by Pla...	2021-04-13	Code
5	R2D2	357867.7	No	-	-	Code
6	DNA	165973	No	DNA: Proximal Policy Optimization with a Dual Ne...	2022-06-20	Code
7	Ape-X	155495.1	No	Distributed Prioritized Experience Replay	2018-03-02	Code
8	Agent57	150854.61	No	Agent57: Outperforming the Atari Human Benchmark	2020-03-30	Code
9	IMPALA (deep)	108590.05	No	IMPALA: Scalable Distributed Deep-RL with Import...	2018-02-05	Code
10	NoisyNet-Dueling	86700	No	Noisy Networks for Exploration	2017-06-30	Code
11	DreamerV2	41526	No	Mastering Atari with Discrete World Models	2020-10-05	Code
12	CGP	9412	No	Evolving simple programs for playing Atari games	2018-06-14	Code
13	A3C LSTM hs	5093.1	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
14	UCT	4660.6	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
15	FQF	4553	No	Fully Parameterized Quantile Function for Distri...	2019-11-05	Code
16	A3C FF hs	4474.5	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
17	QR-DQN-1	4226	No	Distributional Reinforcement Learning with Quant...	2017-10-27	Code
18	Reactor 500M	3726.1	No	The Reactor: A fast and sample-efficient Actor-C...	2017-04-15	-
19	A3C FF (1 day) hs	3009.4	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
20	IQN	2898	No	Implicit Quantile Networks for Distributional Re...	2018-06-14	Code
21	DDQN+Pop-Art noop	2869.3	No	Learning values across many orders of magnitude	2016-02-24	-
22	Duel noop	2837.7	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
23	Prior noop	2654.3	No	Prioritized Experience Replay	2015-11-18	Code
24	POP3D	2488.1	No	Policy Optimization With Penalized Point Probabi...	2018-07-02	Code
25	A2C + SIL	2259.4	No	Self-Imitation Learning	2018-06-14	Code
26	Duel hs	2035.4	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
27	ASL DDQN	1984.5	No	Train a Real-world Local Path Planner in One Hou...	2023-05-07	Code
28	Advantage Learning	1924.42	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
29	Prior hs	1745.1	No	Prioritized Experience Replay	2015-11-18	Code
30	Persistent AL	1673.52	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
31	Nature DQN	1629	No	-	-	Code
32	ES FF (1 hour) noop	1562	No	Evolution Strategies as a Scalable Alternative t...	2017-03-10	Code
33	C51 noop	1516	No	A Distributional Perspective on Reinforcement Le...	2017-07-21	Code
34	DQN hs	1458.7	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
35	DQN noop	1364.5	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
36	DDQN (tuned) hs	1193.2	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
37	Prior+Duel noop	1192.7	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
38	Bootstrapped DQN	1032	No	Deep Exploration via Bootstrapped DQN	2016-02-15	Code
39	Prior+Duel hs	1021.9	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
40	Gorila	933.6	No	Massively Parallel Methods for Deep Reinforcemen...	2015-07-15	Code
41	Best Learner	907.3	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
42	DDQN (tuned) noop	734.7	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
43	SARSA	89	No	-	-	-

#1GDI-H3SOTA
760005
Score· 2022-06-07
Generalized Data Distribution Iteration
#2GDI-I3
751970
Score· 2022-06-07
Generalized Data Distribution Iteration
#3MuZeroSOTA
678558.64
Score· 2019-11-19
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model Code
#4MuZero (Res2 Adam)
476412
Score· 2021-04-13
Online and Offline Reinforcement Learning by Planning with a Learned Model Code
#5R2D2
357867.7
Score
No paperCode
#6DNA
165973
Score· 2022-06-20
DNA: Proximal Policy Optimization with a Dual Network Architecture Code
#7Ape-XSOTA
155495.1
Score· 2018-03-02
Distributed Prioritized Experience Replay Code
#8Agent57
150854.61
Score· 2020-03-30
Agent57: Outperforming the Atari Human Benchmark Code
#9IMPALA (deep)SOTA
108590.05
Score· 2018-02-05
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures Code
#10NoisyNet-DuelingSOTA
86700
Score· 2017-06-30
Noisy Networks for Exploration Code
#11DreamerV2
41526
Score· 2020-10-05
Mastering Atari with Discrete World Models Code
#12CGP
9412
Score· 2018-06-14
Evolving simple programs for playing Atari games Code
#13A3C LSTM hsSOTA
5093.1
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#14UCTSOTA
4660.6
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#15FQF
4553
Score· 2019-11-05
Fully Parameterized Quantile Function for Distributional Reinforcement Learning Code
#16A3C FF hs
4474.5
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#17QR-DQN-1
4226
Score· 2017-10-27
Distributional Reinforcement Learning with Quantile Regression Code
#18Reactor 500M
3726.1
Score· 2017-04-15
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
#19A3C FF (1 day) hs
3009.4
Score· 2016-02-04
Asynchronous Methods for Deep Reinforcement Learning Code
#20IQN
2898
Score· 2018-06-14
Implicit Quantile Networks for Distributional Reinforcement Learning Code
#21DDQN+Pop-Art noop
2869.3
Score· 2016-02-24
Learning values across many orders of magnitude
#22Duel noop
2837.7
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#23Prior noop
2654.3
Score· 2015-11-18
Prioritized Experience Replay Code
#24POP3D
2488.1
Score· 2018-07-02
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization Code
#25A2C + SIL
2259.4
Score· 2018-06-14
Self-Imitation Learning Code
#26Duel hs
2035.4
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#27ASL DDQN
1984.5
Score· 2023-05-07
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity Code
#28Advantage Learning
1924.42
Score· 2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning Code
#29Prior hs
1745.1
Score· 2015-11-18
Prioritized Experience Replay Code
#30Persistent AL
1673.52
Score· 2015-12-15
Increasing the Action Gap: New Operators for Reinforcement Learning Code
#31Nature DQN
1629
Score
No paperCode
#32ES FF (1 hour) noop
1562
Score· 2017-03-10
Evolution Strategies as a Scalable Alternative to Reinforcement Learning Code
#33C51 noop
1516
Score· 2017-07-21
A Distributional Perspective on Reinforcement Learning Code
#34DQN hs
1458.7
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#35DQN noop
1364.5
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#36DDQN (tuned) hs
1193.2
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#37Prior+Duel noop
1192.7
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#38Bootstrapped DQN
1032
Score· 2016-02-15
Deep Exploration via Bootstrapped DQN Code
#39Prior+Duel hs
1021.9
Score· 2015-09-22
Deep Reinforcement Learning with Double Q-learning Code
#40Gorila
933.6
Score· 2015-07-15
Massively Parallel Methods for Deep Reinforcement Learning Code
#41Best Learner
907.3
Score· 2012-07-19
The Arcade Learning Environment: An Evaluation Platform for General Agents Code
#42DDQN (tuned) noop
734.7
Score· 2015-11-20
Dueling Network Architectures for Deep Reinforcement Learning Code
#43SARSA
89
Score
No paper