Code Generation on APPS

Metric: Introductory Pass@any (higher is better)

LeaderboardDataset

    No results available for this benchmark.