gpt-4o-2024-08-06
Reported on 6 benchmarks across 3 tasks · 3 papers · 1 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing4 results
- pass@1· 2024-09-08SOTA0.885best: 0.952 (o1-preview)
- pass@1· 2024-09-190.531best: 0.679 (claude-3-5-sonnet)
- F1 (%)· 2024-09-1292.2best: 95.78 (gpt4-0613_zeroshot)
- F1 (%)· 2024-09-1263.45best: 85.21 (gpt4-0613_fewshot-10)
Knowledge Base2 results
- F1 (%)· 2024-09-1292.2best: 95.78 (gpt4-0613_zeroshot)
- F1 (%)· 2024-09-1263.45best: 85.21 (gpt4-0613_fewshot-10)