3D on cartpole.balance

Metric: Return (higher is better)

LeaderboardDataset
Loading chart...
#ModelReturnAugmentationsPaperDateCode
1SMuZero984.86NoLearning and Planning in Complex Action Spaces2021-04-13Code