Metric: #Learning Samples (N) (higher is better)
| # | Model↕ | #Learning Samples (N)▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | MEAgent | 16 | No | - | - | Code |
| 2 | GPT-4-1106-Vision-Preview | 16 | No | GPT-4 Technical Report | 2023-03-15 | Code |
| 3 | Gemini-1.5 Pro | 16 | No | Gemini 1.5: Unlocking multimodal understanding a... | 2024-03-08 | Code |
| 4 | Qwen-VL-Max | 16 | No | Qwen-VL: A Versatile Vision-Language Model for U... | 2023-08-24 | Code |
| 5 | GLM-4V | 16 | No | CogVLM: Visual Expert for Pretrained Language Mo... | 2023-11-06 | Code |
| 6 | VCIN | 16 | No | - | - | Code |
| 7 | REX | 16 | No | REX: Reasoning-aware and Grounded Explanation | 2022-03-11 | Code |