GPT-4o + CA
Reported on 6 benchmarks across 3 tasks · 1 paper · 1 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Reasoning4 results
- Text Score· 2025-01-23SOTA75.5
- Group Score· 2025-01-2352best: 58.75 (GPT-4V (CoT, pick b/w two options))
- Image Score· 2025-01-2358.5best: 68.75 (GPT-4V (CoT, pick b/w two options))
- 2-Class Accuracy· 2025-01-2392.8best: 93.6 (Gemini-2.0 + CA)
Computer Vision2 results
- Avg. Accuracy· 2025-01-2377.3best: 91.42 (Human (Amateur))
- Avg. Accuracy· 2025-01-2377.3best: 91.42 (Human (Amateur))