Metric: ROUGE-L (higher is better)
| # | Model↕ | ROUGE-L▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | MEAgent | 79.41 | No | - | - | Code |
| 2 | Gemini-1.5 Pro | 55.9 | No | Gemini 1.5: Unlocking multimodal understanding a... | 2024-03-08 | Code |
| 3 | GPT-4-1106-Vision-Preview | 52.67 | No | GPT-4 Technical Report | 2023-03-15 | Code |
| 4 | Qwen-VL-Max | 34.52 | No | Qwen-VL: A Versatile Vision-Language Model for U... | 2023-08-24 | Code |
| 5 | VCIN | 33.34 | No | - | - | Code |
| 6 | GLM-4V | 24.28 | No | CogVLM: Visual Expert for Pretrained Language Mo... | 2023-11-06 | Code |
| 7 | REX | 23.23 | No | REX: Reasoning-aware and Grounded Explanation | 2022-03-11 | Code |