Metric: Detection (higher is better)
| # | Model↕ | Detection▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | MEAgent | 29.09 | No | - | - | Code |
| 2 | GPT-4-1106-Vision-Preview | 7 | No | GPT-4 Technical Report | 2023-03-15 | Code |
| 3 | Gemini-1.5 Pro | 1.4 | No | Gemini 1.5: Unlocking multimodal understanding a... | 2024-03-08 | Code |
| 4 | Qwen-VL-Max | 1.05 | No | Qwen-VL: A Versatile Vision-Language Model for U... | 2023-08-24 | Code |
| 5 | GLM-4V | 0.89 | No | CogVLM: Visual Expert for Pretrained Language Mo... | 2023-11-06 | Code |
| 6 | VCIN | 0.28 | No | - | - | Code |
| 7 | REX | 0 | No | - | - | Code |