Continuous Control on Inverted Pendulum (noisy observations)

Metric: Score (higher is better)

LeaderboardDataset
Loading chart...
#ModelScoreExtra DataPaperDateCode
1TRPO10.4NoBenchmarking Deep Reinforcement Learning for Con...2016-04-22Code