GPT-4-turbo
Reported on 1 benchmark across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing1 result
- wAcc· 2024-12-1934.23best: 35.92 (Claude-3.5)