Metric: Average Score (higher is better)
| # | Model↕ | Average Score▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Stochastic Muzero | 500000 | No | - | - | Code |
| 2 | AlphaZero (With Simulator) | 500000 | No | - | - | Code |
| 3 | MuZero | 300000 | No | - | - | Code |
| 4 | Beam Search | 1024 | No | Playing 2048 With Reinforcement Learning | 2021-10-20 | Code |
| 5 | DQN (1000 episodes) | 256 | No | Playing 2048 With Reinforcement Learning | 2021-10-20 | Code |