Metric: 10 Images, 4*4 Stitching, Exact Accuracy (higher is better)
| # | Model↕ | 10 Images, 4*4 Stitching, Exact Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | GPT-4o | 26.9 | No | GPT-4 Technical Report | 2023-03-15 | Code |
| 2 | GPT-4V | 7.58 | No | GPT-4 Technical Report | 2023-03-15 | Code |
| 3 | Gemini Pro 1.5 | 6.09 | No | Gemini 1.5: Unlocking multimodal understanding a... | 2024-03-08 | Code |
| 4 | Gemini Pro 1.0 | 0.4 | No | Gemini: A Family of Highly Capable Multimodal Mo... | 2023-12-19 | Code |
| 5 | Claude 3 Opus | 0.4 | No | - | - | - |
| 6 | LLaVA-Llama-3 | 0 | No | - | - | Code |
| 7 | IDEFICS2-8B | 0 | No | - | - | - |
| 8 | InstructBLIP-Flan-T5-XXL | 0 | No | - | - | Code |
| 9 | CogVLM2-Llama-3 | 0 | No | - | - | Code |
| 10 | mPLUG-Owl-v2 | 0 | No | - | - | Code |
| 11 | CogVLM-17B | 0 | No | - | - | Code |
| 12 | InstructBLIP-Vicuna-13B | 0 | No | - | - | Code |