TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Playing Games/Atari Games/Atari 2600 Boxing

Atari Games on Atari 2600 Boxing

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Score▼Extra DataPaperDate↕Code
1MuZero100NoMastering Atari, Go, Chess and Shogi by Planning...2019-11-19Code
2Ape-X100NoDistributed Prioritized Experience Replay2018-03-02Code
3NoisyNet-Dueling100NoNoisy Networks for Exploration2017-06-30Code
4UCT100NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
5Agent57100NoAgent57: Outperforming the Atari Human Benchmark2020-03-30Code
6MuZero (Res2 Adam)100NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code
7GDI-H3100NoGeneralized Data Distribution Iteration2022-06-07-
8GDI-I3100NoGeneralized Data Distribution Iteration2022-06-07-
9GDI-H3100NoGeneralized Data Distribution Iteration2022-06-07-
10IMPALA (deep)99.96NoIMPALA: Scalable Distributed Deep-RL with Import...2018-02-05Code
11QR-DQN-199.9NoDistributional Reinforcement Learning with Quant...2017-10-27Code
12DNA99.9NoDNA: Proximal Policy Optimization with a Dual Ne...2022-06-20Code
13IQN99.8NoImplicit Quantile Networks for Distributional Re...2018-06-14Code
14A2C + SIL99.6NoSelf-Imitation Learning2018-06-14Code
15ASL DDQN99.6NoTrain a Real-world Local Path Planner in One Hou...2023-05-07Code
16Duel noop99.4NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
17Reactor 500M99.4NoThe Reactor: A fast and sample-efficient Actor-C...2017-04-15-
18DDQN+Pop-Art noop99.3NoLearning values across many orders of magnitude2016-02-24-
19Prior+Duel noop98.9NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
20R2D298.5No--Code
21DDRL A3C98NoDistributed Deep Reinforcement Learning: Learn h...2018-01-09Code
22C51 noop97.8NoA Distributional Perspective on Reinforcement Le...2017-07-21Code
23POP3D97.23NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code
24Prior noop95.6NoPrioritized Experience Replay2015-11-18Code
25Persistent AL94.3NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
26Advantage Learning93.94NoIncreasing the Action Gap: New Operators for Rei...2015-12-15Code
27Bootstrapped DQN93.2NoDeep Exploration via Bootstrapped DQN2016-02-15Code
28DreamerV292NoMastering Atari with Discrete World Models2020-10-05Code
29DDQN (tuned) noop91.6NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
30DQN noop88NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
31Prior+Duel hs79.2NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
32Duel hs77.3NoDueling Network Architectures for Deep Reinforce...2015-11-20Code
33Gorila74.2NoMassively Parallel Methods for Deep Reinforcemen...2015-07-15Code
34DDQN (tuned) hs73.5NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
35Prior hs72.3NoPrioritized Experience Replay2015-11-18Code
36Nature DQN71.8No--Code
37DQN hs70.3NoDeep Reinforcement Learning with Double Q-learning2015-09-22Code
38A3C FF hs59.8NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
39ES FF (1 hour) noop49.8NoEvolution Strategies as a Scalable Alternative t...2017-03-10Code
40Best Learner44NoThe Arcade Learning Environment: An Evaluation P...2012-07-19Code
41CGP38.4NoEvolving simple programs for playing Atari games2018-06-14Code
42A3C LSTM hs37.3NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
43A3C FF (1 day) hs33.7NoAsynchronous Methods for Deep Reinforcement Lear...2016-02-04Code
44SARSA9.8No---
45CURL4.8NoCURL: Contrastive Unsupervised Representations f...2020-04-08Code