TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Seaquest

Atari Games on Atari 2600 Seaquest

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1GDI-H3(200M frames)1000000NoGeneralized Data Distribution Iteration2022-06-07-
2GDI-H31000000NoGeneralized Data Distribution Iteration2022-06-07-
3Agent57999997.63NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
4R2D2999996.7No--Code
5MuZero999976.52NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
6MuZero (Res2 Adam)999659.18NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
7GDI-I3943910NoGDI: Rethinking What Makes Reinforcement Learnin...2021-06-11-
8GDI-I3943910NoGDI: Rethinking What Makes Reinforcement Learnin...2021-06-11-
9Ape-X392952.3NoDistributed Prioritized Experience Replay2018-03-02Code
10C51 noop266434NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
11Duel noop50254.2NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
12Duel hs37361.6NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
13IQN30140NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
14ASL DDQN29278.6NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
15Prior noop26357.8NoPrioritized Experience Replay2015-11-18Code
16Prior hs25463.7NoPrioritized Experience Replay2015-11-18Code
17NoisyNet-Dueling16754NoNoisy Networks for Exploration2017-06-30Code
18DDQN (tuned) noop16452.7NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
19DDQN (tuned) hs14498NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
20Persistent AL13230.74NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
21DDQN+Pop-Art noop10932.3NoLearning values across many orders of magnitude2016-02-24-
22Gorila10145.9NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
23Bootstrapped DQN9083.1NoDeep Exploration via Bootstrapped DQN2016-02-15Code
24Advantage Learning8670.5NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
25QR-DQN-18268NoDistributional Reinforcement Learning with Quant...2017-10-27Code
26DreamerV27480NoMastering Atari with Discrete World Models2020-10-05Code
27Recurrent Rational DQN Average7460NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
28DARQN soft7263NoDeep Attention Recurrent Q-Network2015-12-05Code
29Rational DQN Average6603NoAdaptive Rational Activations to Boost Deep Rein...2021-02-18Code
30DQN noop5860.6NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
31VPN5628NoValue Prediction Network2017-07-11Code
32Nature DQN5286No--Code
33UCT5132.4NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
34DQN hs4216.7NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
35DNA4146NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
36A2C + SIL2456.5NoSelf-Imitation Learning2018-06-14Code
37A3C FF hs2355.4NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
38A3C FF (1 day) hs2300.2NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
39DDRL A3C1832NoDistributed Deep Reinforcement Learning: Learn h...2018-01-09Code
40POP3D1807.47NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
41IMPALA (deep)1753.2NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
42DQN Best1740NoPlaying Atari with Deep Reinforcement Learning2013-12-19Code
43MAC1703.4NoMean Actor Critic2017-09-01Code
44Prior+Duel hs1431.2NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
45ES FF (1 hour) noop1390NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
46A3C LSTM hs1326.1NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
47Prior+Duel noop931.6NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
48CGP724NoEvolving simple programs for playing Atari games2018-06-14Code
49SARSA675.5No---
50Best Learner664.8NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
51Discrete Latent Space World Model (VQ-VAE)635NoSmaller World Models for Reinforcement Learning2020-10-12-
52Rainbow+SEER561.2NoImproving Computational Efficiency in Visual Rei...2021-03-04Code
53CURL408NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
54IDVQ + DRSC + XNES320NoPlaying Atari with Six Neurons2018-06-04Code
55SAC211.6NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code
56DT2.4NoDecision Transformer: Reinforcement Learning via...2021-06-02Code