Metric: Score (higher is better)
| # | Model↕ | Score▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Go-Explore | 102571 | Yes | Go-Explore: a New Approach for Hard-Exploration ... | 2019-01-30 | Code |
| 2 | Agent57 | 18756.01 | No | Agent57: Outperforming the Atari Human Benchmark | 2020-03-30 | Code |
| 3 | Go-Explore | 6954 | No | First return, then explore | 2020-04-27 | Code |
| 4 | IQN | 0 | No | - | - | Code |
| 5 | MuZero | 0 | No | - | - | Code |
| 6 | R2D2 | 0 | No | - | - | Code |
| 7 | CGP | 0 | No | - | - | Code |
| 8 | NoisyNet-Dueling | 0 | No | - | - | Code |
| 9 | POP3D | 0 | No | - | - | Code |
| 10 | Advantage Learning | 0 | No | - | - | Code |
| 11 | QR-DQN-1 | 0 | No | - | - | Code |
| 12 | DreamerV2 | 0 | No | - | - | Code |
| 13 | MuZero (Res2 Adam) | 0 | No | - | - | Code |
| 14 | GDI-I3 | 0 | No | - | - | - |
| 15 | GDI-I3 | 0 | No | - | - | - |
| 16 | DNA | 0 | No | - | - | Code |
| 17 | SND-V | 0 | No | - | - | Code |
| 18 | SND-VIC | 0 | No | - | - | Code |
| 19 | ASL DDQN | 0 | No | - | - | Code |
| 20 | Ape-X | -0.6 | No | Distributed Prioritized Experience Replay | 2018-03-02 | Code |
| 21 | IMPALA (deep) | -1.66 | No | IMPALA: Scalable Distributed Deep-RL with Import... | 2018-02-05 | Code |
| 22 | RND | -3 | No | Exploration by Random Network Distillation | 2018-10-30 | Code |
| 23 | GDI-H3 | -4.345 | No | Generalized Data Distribution Iteration | 2022-06-07 | - |