Code Generation on APPS

Metric: Interview Pass@any (higher is better)

LeaderboardDataset
#ModelInterview Pass@anyExtra DataPaperDateCode

No results available for this benchmark.