Continuous Control on cartpole.balance

Metric: Return (higher is better)

LeaderboardDataset
Loading chart...
#ModelReturnExtra DataPaperDateCode
1SMuZero984.86NoLearning and Planning in Complex Action Spaces2021-04-13Code