Metric: No Context (higher is better)
| # | Model↕ | No Context▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | MC-CoT F-Large | 94.49 | No | Boosting the Power of Small Multimodal Reasoning... | 2023-11-23 | Code |
| 2 | Honeybee | 93.17 | Yes | Honeybee: Locality-enhanced Projector for Multim... | 2023-12-11 | Code |
| 3 | Multimodal CoT | 92.89 | No | Multimodal Chain-of-Thought Reasoning in Languag... | 2023-02-02 | Code |
| 4 | Chat-UniVi-13B | 90.94 | Yes | Chat-UniVi: Unified Visual Representation Empowe... | 2023-11-14 | Code |
| 5 | UnifiedQA-BASE - CoT (QCM→ALE) | 81.81 | No | Learn to Explain: Multimodal Reasoning via Thoug... | 2022-09-20 | Code |
| 6 | GPT-3 - CoT (QCM→ALE , 2-shot) | 79.93 | No | Learn to Explain: Multimodal Reasoning via Thoug... | 2022-09-20 | Code |
| 7 | GPT-3 - CoT(QCM→AE, 2-shot) | 79.58 | No | Learn to Explain: Multimodal Reasoning via Thoug... | 2022-09-20 | Code |
| 8 | GPT-3 (QCM→A, 2-shot) | 77.42 | No | Learn to Explain: Multimodal Reasoning via Thoug... | 2022-09-20 | Code |