Multi-agent Reinforcement Learning on UAV Logistics

Metric: Average Reward (higher is better)

LeaderboardDataset