Code Generation on APPS

Metric: Pass@1 (higher is better)

LeaderboardDataset
Loading chart...
#ModelPass@1Extra DataPaperDateCode
1LPW (GPT-4o)62.6NoPlanning-Driven Programming: A Large Language Mo...2024-11-21Code