TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Video Games/Atari 2600 Private Eye

Video Games on Atari 2600 Private Eye

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1Go-Explore95756NoFirst return, then explore2020-04-27Code
2Agent5779716.46NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
3SND-VIC17313NoSelf-supervised network distillation: an effecti...2023-02-22Code
4MuZero15299.98NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
5GDI-I315100NoGeneralized Data Distribution Iteration2022-06-07-
6GDI-I315100NoGeneralized Data Distribution Iteration2022-06-07-
7GDI-H315100NoGeneralized Data Distribution Iteration2022-06-07-
8C51 noop15095NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
9SND-STD15089NoSelf-supervised network distillation: an effecti...2023-02-22Code
10CGP12702.2NoEvolving simple programs for playing Atari games2018-06-14Code
11RND8666NoExploration by Random Network Distillation2018-10-30Code
12DQN-PixelCNN8358.7NoCount-Based Exploration with Neural Density Models2017-03-03Code
13R2D25322.7No--Code
14Advantage Learning5276.16NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
15SND-V4213NoSelf-supervised network distillation: an effecti...2023-02-22Code
16Intrinsic Reward Agent3036.5NoLarge-Scale Study of Curiosity-Driven Learning2018-08-13Code
17Gorila2598.6NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
18DreamerV22198NoMastering Atari with Discrete World Models2020-10-05Code
19Best Baseline1947.3NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
20Bootstrapped DQN1812.5NoDeep Exploration via Bootstrapped DQN2016-02-15Code
21Nature DQN1788No--Code
22Prior+Duel hs1277.6NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
23Best Learner684.3NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
24Prior hs670.7NoPrioritized Experience Replay2015-11-18Code
25A2C + SIL661.2NoSelf-Imitation Learning2018-06-14Code
26A3C LSTM hs421.1NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
27QR-DQN-1350NoDistributional Reinforcement Learning with Quant...2017-10-27Code
28ASL DDQN349.7NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
29Duel hs292.6NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
30DDQN+Pop-Art noop286.7NoLearning values across many orders of magnitude2016-02-24-
31NoisyNet-Dueling279NoNoisy Networks for Exploration2017-06-30Code
32DQN hs207.9NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
33A3C FF hs206.9NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
34Prior+Duel noop206NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
35DQN-CTS206NoCount-Based Exploration with Neural Density Models2017-03-03Code
36Prior noop200NoPrioritized Experience Replay2015-11-18Code
37IQN200NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
38A3C FF (1 day) hs194.4NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
39DQN noop146.7NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
40DDQN (tuned) noop129.7NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
41CURL105.2NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
42Duel noop103NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
43ES FF (1 hour) noop100NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
44MuZero (Res2 Adam)100NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
45DNA100NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
46A3C-CTS99.32NoUnifying Count-Based Exploration and Intrinsic M...2016-06-06Code
47DQNMMCe+SR99.1NoCount-Based Exploration with the Successor Repre...2018-07-31Code
48IMPALA (deep)98.5NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
49SARSA86No---
50POP3D79.67NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
51Ape-X49.8NoDistributed Prioritized Experience Replay2018-03-02Code
52DDQN (tuned) hs-575.5NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code