Continuous Control on cartpole.swingup

Metric: Return (higher is better)

LeaderboardDataset
Loading chart...
#ModelReturnExtra DataPaperDateCode
1SMuZero868.87NoLearning and Planning in Complex Action Spaces2021-04-13Code
2MuZero Unplugged594.3NoOnline and Offline Reinforcement Learning by Pla...2021-04-13Code