Metric: Average Return (higher is better)
| # | Model↕ | Average Return▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | SAC | 5745.27 | No | Soft Actor-Critic: Off-Policy Maximum Entropy De... | 2018-01-04 | Code |
| 2 | MEow | 5526.66 | No | Maximum Entropy Reinforcement Learning via Energ... | 2024-05-22 | Code |
| 3 | DDPG | 2994.54 | No | Continuous control with deep reinforcement learn... | 2015-09-09 | Code |
| 4 | PPO | 2739.81 | No | Proximal Policy Optimization Algorithms | 2017-07-20 | Code |
| 5 | TD3 | 2612.74 | No | Addressing Function Approximation Error in Actor... | 2018-02-26 | Code |