MuJoCo Games on Walker2d

Metric: Mean (higher is better)

LeaderboardDataset
Loading chart...
#ModelMeanExtra DataPaperDateCode
1IQ-Learn5134NoIQ-Learn: Inverse soft-Q Learning for Imitation2021-06-23Code
2POP3D3966.01NoPolicy Optimization With Penalized Point Probabi...2018-07-02Code