Multi-agent Reinforcement Learning on ParticleEnvs Cooperative Communication

Metric: final agent reward (higher is better)

LeaderboardDataset
Loading chart...