Multi-agent Reinforcement Learning on Off_Superhard_sequential

Metric: Median Win Rate (higher is better)

LeaderboardDataset