TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Video Games/Atari 2600 Ms. Pacman

Video Games on Atari 2600 Ms. Pacman

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1MuZero243401.1NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
2MuZero (Res2 Adam)70659.76NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
3Agent5763994.44NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
4R2D242281.7No--Code
5UCT22336NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
6GDI-H311573NoGeneralized Data Distribution Iteration2022-06-07-
7GDI-I311536NoGeneralized Data Distribution Iteration2022-06-07-
8GDI-I311536NoGeneralized Data Distribution Iteration2022-06-07-
9Ape-X11255.2NoDistributed Prioritized Experience Replay2018-03-02Code
10MFEC8530.4004NoModel-Free Episodic Control with State Aggregation2020-08-21-
11FQF7631.9NoFully Parameterized Quantile Function for Distri...2019-11-05Code
12IMPALA (deep)7342.32NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
13Prior noop6518.7NoPrioritized Experience Replay2015-11-18Code
14IQN6349NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
15Duel noop6283.5NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
16DNA5894NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
17QR-DQN-15821NoDistributional Reinforcement Learning with Quant...2017-10-27Code
18DreamerV25652NoMastering Atari with Discrete World Models2020-10-05Code
19NoisyNet-Dueling5546NoNoisy Networks for Exploration2017-06-30Code
20DDQN+Pop-Art noop4963.8NoLearning values across many orders of magnitude2016-02-24-
21ASL DDQN4416NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
22Advantage Learning4065.8NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
23A2C + SIL4025.1NoSelf-Imitation Learning2018-06-14Code
24Persistent AL3917.55NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
25C51 noop3415NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
26Prior+Duel noop3327.3NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
27DQN noop3085.6NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
28Bootstrapped DQN2983.3NoDeep Exploration via Bootstrapped DQN2016-02-15Code
29DDQN (tuned) noop2711.4NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
30VPN2689NoValue Prediction Network2017-07-11Code
31Rainbow2570.2NoRainbow: Combining Improvements in Deep Reinforc...2017-10-06Code
32CGP2568NoEvolving simple programs for playing Atari games2018-06-14Code
33Nature DQN2311No--Code
34Duel hs2250.6NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
35Prior hs1865.9NoPrioritized Experience Replay2015-11-18Code
36Best Learner1691.8NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
37POP3D1683.87NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
38CURL1492.8NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code
39Gorila1263NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
40DDQN (tuned) hs1241.3NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
41SARSA1227No---
42DQN hs1092.3NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
43Prior+Duel hs1007.8NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
44A3C LSTM hs850.7NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
45SAC690.9NoSoft Actor-Critic for Discrete Action Settings2019-10-16Code
46A3C FF hs653.7NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
47A3C FF (1 day) hs594.4NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code