Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Image Classification
/
ColonINST-v1 (Unseen)
Image Classification on ColonINST-v1 (Unseen)
Metric: Accuray (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
Accuray (best first)
Accuray (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
Accuray
▼
Extra Data
Paper
Date
↕
Code
1
ColonGPT (w/ LoRA, w/o extra data)
83.24
No
Frontiers in Intelligent Colonoscopy
2024-10-22
Code
2
LLaVA-v1.5 (w/ LoRA, w/ extra data)
80.89
No
Improved Baselines with Visual Instruction Tuning
2023-10-05
Code
3
MobileVLM-1.7B (w/ LoRA, w/ extra data)
80.44
No
MobileVLM : A Fast, Strong and Open Vision Langu...
2023-12-28
Code
4
Bunny-v1.0-3B (w/ LoRA, w/ extra data)
79.5
No
Efficient Multimodal Learning from Data-centric ...
2024-02-18
Code
5
LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)
79.24
No
LLaVA-Med: Training a Large Language-and-Vision ...
2023-06-01
Code
6
LLaVA-v1.5 (w/ LoRA, w/o extra data)
79.1
No
Improved Baselines with Visual Instruction Tuning
2023-10-05
Code
7
MGM-2B (w/o LoRA, w/o extra data)
78.99
No
Mini-Gemini: Mining the Potential of Multi-modal...
2024-03-27
Code
8
MobileVLM-1.7B (w/o LoRA, w/ extra data)
78.75
No
MobileVLM : A Fast, Strong and Open Vision Langu...
2023-12-28
Code
9
MGM-2B (w/o LoRA, w/ extra data)
78.69
No
Mini-Gemini: Mining the Potential of Multi-modal...
2024-03-27
Code
10
LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)
78.04
No
LLaVA-Med: Training a Large Language-and-Vision ...
2023-06-01
Code
11
MiniGPT-v2 (w/ LoRA, w/o extra data)
77.93
No
MiniGPT-v2: large language model as a unified in...
2023-10-14
Code
12
LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)
77.38
No
LLaVA-Med: Training a Large Language-and-Vision ...
2023-06-01
Code
13
MiniGPT-v2 (w/ LoRA, w/ extra data)
76.82
No
MiniGPT-v2: large language model as a unified in...
2023-10-14
Code
14
Bunny-v1.0-3B (w/ LoRA, w/o extra data)
75.5
No
Efficient Multimodal Learning from Data-centric ...
2024-02-18
Code
15
LLaVA-v1 (w/ LoRA, w/o extra data)
72.08
No
Visual Instruction Tuning
2023-04-17
Code
16
LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)
66.51
No
LLaVA-Med: Training a Large Language-and-Vision ...
2023-06-01
Code
17
LLaVA-v1 (w/ LoRA, w/ extra data)
42.17
No
Visual Instruction Tuning
2023-04-17
Code
#1
ColonGPT (w/ LoRA, w/o extra data)
SOTA
83.24
Accuray
· 2024-10-22
Frontiers in Intelligent Colonoscopy
Code
#2
LLaVA-v1.5 (w/ LoRA, w/ extra data)
SOTA
80.89
Accuray
· 2023-10-05
Improved Baselines with Visual Instruction Tuning
Code
#3
MobileVLM-1.7B (w/ LoRA, w/ extra data)
80.44
Accuray
· 2023-12-28
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
Code
#4
Bunny-v1.0-3B (w/ LoRA, w/ extra data)
79.5
Accuray
· 2024-02-18
Efficient Multimodal Learning from Data-centric Perspective
Code
#5
LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)
SOTA
79.24
Accuray
· 2023-06-01
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Code
#6
LLaVA-v1.5 (w/ LoRA, w/o extra data)
79.1
Accuray
· 2023-10-05
Improved Baselines with Visual Instruction Tuning
Code
#7
MGM-2B (w/o LoRA, w/o extra data)
78.99
Accuray
· 2024-03-27
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Code
#8
MobileVLM-1.7B (w/o LoRA, w/ extra data)
78.75
Accuray
· 2023-12-28
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
Code
#9
MGM-2B (w/o LoRA, w/ extra data)
78.69
Accuray
· 2024-03-27
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Code
#10
LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)
78.04
Accuray
· 2023-06-01
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Code
#11
MiniGPT-v2 (w/ LoRA, w/o extra data)
77.93
Accuray
· 2023-10-14
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Code
#12
LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)
77.38
Accuray
· 2023-06-01
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Code
#13
MiniGPT-v2 (w/ LoRA, w/ extra data)
76.82
Accuray
· 2023-10-14
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Code
#14
Bunny-v1.0-3B (w/ LoRA, w/o extra data)
75.5
Accuray
· 2024-02-18
Efficient Multimodal Learning from Data-centric Perspective
Code
#15
LLaVA-v1 (w/ LoRA, w/o extra data)
SOTA
72.08
Accuray
· 2023-04-17
Visual Instruction Tuning
Code
#16
LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)
66.51
Accuray
· 2023-06-01
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Code
#17
LLaVA-v1 (w/ LoRA, w/ extra data)
42.17
Accuray
· 2023-04-17
Visual Instruction Tuning
Code