GPT-4o (CoT)
Reported on 6 benchmarks across 2 tasks
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing6 results
- Group Score35
- Text Score59.2
- Video Score51
- Group Score35
- Text Score59.2
- Video Score51