Continuous Control on reacher.hard

Metric: Return (higher is better)

LeaderboardDataset
Loading chart...
#ModelReturnExtra DataPaperDateCode
1SMuZero971.53NoLearning and Planning in Complex Action Spaces2021-04-13Code