Metric: Accuracy (higher is better)
| # | Model↕ | Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | GPT4V | 22.76 | Yes | Measuring Multimodal Mathematical Reasoning with... | 2024-02-22 | Code |
| 2 | Gemini Pro | 17.66 | Yes | Measuring Multimodal Mathematical Reasoning with... | 2024-02-22 | Code |
| 3 | Qwen-VL-Max | 15.59 | Yes | Measuring Multimodal Mathematical Reasoning with... | 2024-02-22 | Code |
| 4 | InternLM-XComposer2-VL | 14.54 | Yes | Measuring Multimodal Mathematical Reasoning with... | 2024-02-22 | Code |