Code Generation on APPS

Metric: Introductory Pass@1000 (higher is better)

LeaderboardDataset
#ModelIntroductory Pass@1000Extra DataPaperDateCode

No results available for this benchmark.