Code Generation on Livecodebench
Metric: Acc (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | Acc▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Xolver | 91.6 | No | Xolver: Multi-Agent Reasoning with Holistic Expe... | 2025-06-17 | Code |
| 2 | LPW (GPT-4o) | 59.3 | No | Planning-Driven Programming: A Large Language Mo... | 2024-11-21 | Code |
| 3 | Search-o1 | 33 | Yes | Search-o1: Agentic Search-Enhanced Large Reasoni... | 2025-01-09 | Code |