Video Games on Atari 2600 Gopher

Metric: Score (higher is better)

LeaderboardDataset

Loading chart...

Results

#	Model↕	Score▼	Extra Data	Paper	Date↕	Code
1	GDI-I3	488830	No	Generalized Data Distribution Iteration	2022-06-07	-
2	GDI-H3	473560	No	Generalized Data Distribution Iteration	2022-06-07	-
3	MuZero	130345.58	No	Mastering Atari, Go, Chess and Shogi by Planning...	2019-11-19	Code
4	R2D2	124776.3	No	-	-	Code
5	MuZero (Res2 Adam)	122882.5	No	Online and Offline Reinforcement Learning by Pla...	2021-04-13	Code
6	Ape-X	120500.9	No	Distributed Prioritized Experience Replay	2018-03-02	Code
7	IQN	118365	No	Implicit Quantile Networks for Distributional Re...	2018-06-14	Code
8	Agent57	117777.08	No	Agent57: Outperforming the Atari Human Benchmark	2020-03-30	Code
9	QR-DQN-1	113585	No	Distributional Reinforcement Learning with Quant...	2017-10-27	Code
10	Prior+Duel hs	105148.4	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
11	Prior+Duel noop	104368.2	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
12	ASL DDQN	103514.4	No	Train a Real-world Local Path Planner in One Hou...	2023-05-07	Code
13	DreamerV2	92282	No	Mastering Atari with Discrete World Models	2020-10-05	Code
14	DNA	80104	No	DNA: Proximal Policy Optimization with a Dual Ne...	2022-06-20	Code
15	IMPALA (deep)	66782.3	No	IMPALA: Scalable Distributed Deep-RL with Import...	2018-02-05	Code
16	DDQN+Pop-Art noop	56218.2	No	Learning values across many orders of magnitude	2016-02-24	-
17	NoisyNet-Dueling	38909	No	Noisy Networks for Exploration	2017-06-30	Code
18	Prior hs	34858.8	No	Prioritized Experience Replay	2015-11-18	Code
19	C51 noop	33641	No	A Distributional Perspective on Reinforcement Le...	2017-07-21	Code
20	Prior noop	32487.2	No	Prioritized Experience Replay	2015-11-18	Code
21	A2C + SIL	23304.2	No	Self-Imitation Learning	2018-06-14	Code
22	UCT	20560	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
23	Duel hs	20051.4	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
24	Bootstrapped DQN	17438.4	No	Deep Exploration via Bootstrapped DQN	2016-02-15	Code
25	A3C LSTM hs	17106.8	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
26	Duel noop	15718.4	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
27	DDQN (tuned) hs	15253	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
28	DDQN (tuned) noop	14840.8	No	Dueling Network Architectures for Deep Reinforce...	2015-11-20	Code
29	Advantage Learning	11912.68	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
30	Persistent AL	10611.81	No	Increasing the Action Gap: New Operators for Rei...	2015-12-15	Code
31	A3C FF hs	10022.8	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
32	DQN noop	8777.4	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
33	Nature DQN	8520	No	-	-	Code
34	A3C FF (1 day) hs	8442.8	No	Asynchronous Methods for Deep Reinforcement Lear...	2016-02-04	Code
35	DQN hs	8190.4	No	Deep Reinforcement Learning with Double Q-learning	2015-09-22	Code
36	POP3D	6207	No	Policy Optimization With Penalized Point Probabi...	2018-07-02	Code
37	DARQN soft	5356	No	Deep Attention Recurrent Q-Network	2015-12-05	Code
38	Gorila	4373	No	Massively Parallel Methods for Deep Reinforcemen...	2015-07-15	Code
39	SARSA	2368	No	-	-	-
40	CGP	1696	No	Evolving simple programs for playing Atari games	2018-06-14	Code
41	Best Learner	1288.3	No	The Arcade Learning Environment: An Evaluation P...	2012-07-19	Code
42	CURL	801.4	No	CURL: Contrastive Unsupervised Representations f...	2020-04-08	Code
43	ES FF (1 hour) noop	582	No	Evolution Strategies as a Scalable Alternative t...	2017-03-10	Code