Metric: Accuray (higher is better)
| # | Model↕ | Accuray▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | ColonGPT (w/ LoRA, w/o extra data) | 83.24 | No | Frontiers in Intelligent Colonoscopy | 2024-10-22 | Code |
| 2 | LLaVA-v1.5 (w/ LoRA, w/ extra data) | 80.89 | No | Improved Baselines with Visual Instruction Tuning | 2023-10-05 | Code |
| 3 | MobileVLM-1.7B (w/ LoRA, w/ extra data) | 80.44 | No | MobileVLM : A Fast, Strong and Open Vision Langu... | 2023-12-28 | Code |
| 4 | Bunny-v1.0-3B (w/ LoRA, w/ extra data) | 79.5 | No | Efficient Multimodal Learning from Data-centric ... | 2024-02-18 | Code |
| 5 | LLaVA-Med-v1.5 (w/ LoRA, w/o extra data) | 79.24 | No | LLaVA-Med: Training a Large Language-and-Vision ... | 2023-06-01 | Code |
| 6 | LLaVA-v1.5 (w/ LoRA, w/o extra data) | 79.1 | No | Improved Baselines with Visual Instruction Tuning | 2023-10-05 | Code |
| 7 | MGM-2B (w/o LoRA, w/o extra data) | 78.99 | No | Mini-Gemini: Mining the Potential of Multi-modal... | 2024-03-27 | Code |
| 8 | MobileVLM-1.7B (w/o LoRA, w/ extra data) | 78.75 | No | MobileVLM : A Fast, Strong and Open Vision Langu... | 2023-12-28 | Code |
| 9 | MGM-2B (w/o LoRA, w/ extra data) | 78.69 | No | Mini-Gemini: Mining the Potential of Multi-modal... | 2024-03-27 | Code |
| 10 | LLaVA-Med-v1.0 (w/o LoRA, w/o extra data) | 78.04 | No | LLaVA-Med: Training a Large Language-and-Vision ... | 2023-06-01 | Code |
| 11 | MiniGPT-v2 (w/ LoRA, w/o extra data) | 77.93 | No | MiniGPT-v2: large language model as a unified in... | 2023-10-14 | Code |
| 12 | LLaVA-Med-v1.0 (w/o LoRA, w/ extra data) | 77.38 | No | LLaVA-Med: Training a Large Language-and-Vision ... | 2023-06-01 | Code |
| 13 | MiniGPT-v2 (w/ LoRA, w/ extra data) | 76.82 | No | MiniGPT-v2: large language model as a unified in... | 2023-10-14 | Code |
| 14 | Bunny-v1.0-3B (w/ LoRA, w/o extra data) | 75.5 | No | Efficient Multimodal Learning from Data-centric ... | 2024-02-18 | Code |
| 15 | LLaVA-v1 (w/ LoRA, w/o extra data) | 72.08 | No | Visual Instruction Tuning | 2023-04-17 | Code |
| 16 | LLaVA-Med-v1.5 (w/ LoRA, w/ extra data) | 66.51 | No | LLaVA-Med: Training a Large Language-and-Vision ... | 2023-06-01 | Code |
| 17 | LLaVA-v1 (w/ LoRA, w/ extra data) | 42.17 | No | Visual Instruction Tuning | 2023-04-17 | Code |