gpt-4-1106-preview
Reported on 4 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Computer Code4 results
- API· 2024-03-0762.58best: 75.16 (deepseek-coder-33b-base)
- Algorithmic· 2024-03-0742.11best: 60.78 (deepseek-coder-33b-base)
- Average· 2024-03-0753.28best: 69.01 (deepseek-coder-33b-base)
- Control· 2024-03-0755.15best: 71.1 (deepseek-coder-33b-base)