Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey Levine
A platform for Applied Reinforcement Learning (Applied RL)
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| OpenAI Gym | Humanoid-v4 | Average Return | 6211.5 | SAC |
| OpenAI Gym | HalfCheetah-v4 | Average Return | 15836.04 | SAC |
| OpenAI Gym | Ant-v4 | Average Return | 5208.09 | SAC |
| OpenAI Gym | Walker2d-v4 | Average Return | 5745.27 | SAC |
| OpenAI Gym | Hopper-v4 | Average Return | 2882.56 | SAC |