Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Image Classification
/
ColonINST-v1 (Seen)
Image Classification on ColonINST-v1 (Seen)
Metric: Accuray (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
Accuray (best first)
Accuray (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
Accuray
▼
Extra Data
Paper
Date
↕
Code
1
ColonGPT (w/ LoRA, w/o extra data)
94.06
No
Frontiers in Intelligent Colonoscopy
2024-10-22
Code
2
LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)
93.84
No
LLaVA-Med: Training a Large Language-and-Vision ...
2023-06-01
Code
3
MobileVLM-1.7B (w/ LoRA, w/ extra data)
93.64
No
MobileVLM : A Fast, Strong and Open Vision Langu...
2023-12-28
Code
4
LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)
93.62
No
LLaVA-Med: Training a Large Language-and-Vision ...
2023-06-01
Code
5
LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)
93.52
No
LLaVA-Med: Training a Large Language-and-Vision ...
2023-06-01
Code
6
LLaVA-v1.5 (w/ LoRA, w/ extra data)
93.33
No
Improved Baselines with Visual Instruction Tuning
2023-10-05
Code
7
MGM-2B (w/o LoRA, w/ extra data)
93.24
No
Mini-Gemini: Mining the Potential of Multi-modal...
2024-03-27
Code
8
MobileVLM-1.7B (w/o LoRA, w/ extra data)
93.02
No
MobileVLM : A Fast, Strong and Open Vision Langu...
2023-12-28
Code
9
LLaVA-v1.5 (w/ LoRA, w/o extra data)
92.97
No
Improved Baselines with Visual Instruction Tuning
2023-10-05
Code
10
MGM-2B (w/o LoRA, w/o extra data)
92.97
No
Mini-Gemini: Mining the Potential of Multi-modal...
2024-03-27
Code
11
Bunny-v1.0-3B (w/ LoRA, w/ extra data)
92.47
No
Efficient Multimodal Learning from Data-centric ...
2024-02-18
Code
12
MiniGPT-v2 (w/ LoRA, w/o extra data)
91.49
No
MiniGPT-v2: large language model as a unified in...
2023-10-14
Code
13
Bunny-v1.0-3B (w/ LoRA, w/o extra data)
91.16
No
Efficient Multimodal Learning from Data-centric ...
2024-02-18
Code
14
MiniGPT-v2 (w/ LoRA, w/ extra data)
90
No
MiniGPT-v2: large language model as a unified in...
2023-10-14
Code
15
LLaVA-v1 (w/ LoRA, w/ extra data)
89.61
No
Visual Instruction Tuning
2023-04-17
Code
16
LLaVA-v1 (w/ LoRA, w/o extra data)
87.86
No
Visual Instruction Tuning
2023-04-17
Code
17
LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)
87.22
No
LLaVA-Med: Training a Large Language-and-Vision ...
2023-06-01
Code
#1
ColonGPT (w/ LoRA, w/o extra data)
SOTA
94.06
Accuray
· 2024-10-22
Frontiers in Intelligent Colonoscopy
Code
#2
LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)
SOTA
93.84
Accuray
· 2023-06-01
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Code
#3
MobileVLM-1.7B (w/ LoRA, w/ extra data)
93.64
Accuray
· 2023-12-28
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
Code
#4
LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)
93.62
Accuray
· 2023-06-01
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Code
#5
LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)
93.52
Accuray
· 2023-06-01
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Code
#6
LLaVA-v1.5 (w/ LoRA, w/ extra data)
93.33
Accuray
· 2023-10-05
Improved Baselines with Visual Instruction Tuning
Code
#7
MGM-2B (w/o LoRA, w/ extra data)
93.24
Accuray
· 2024-03-27
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Code
#8
MobileVLM-1.7B (w/o LoRA, w/ extra data)
93.02
Accuray
· 2023-12-28
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
Code
#9
LLaVA-v1.5 (w/ LoRA, w/o extra data)
92.97
Accuray
· 2023-10-05
Improved Baselines with Visual Instruction Tuning
Code
#10
MGM-2B (w/o LoRA, w/o extra data)
92.97
Accuray
· 2024-03-27
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Code
#11
Bunny-v1.0-3B (w/ LoRA, w/ extra data)
92.47
Accuray
· 2024-02-18
Efficient Multimodal Learning from Data-centric Perspective
Code
#12
MiniGPT-v2 (w/ LoRA, w/o extra data)
91.49
Accuray
· 2023-10-14
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Code
#13
Bunny-v1.0-3B (w/ LoRA, w/o extra data)
91.16
Accuray
· 2024-02-18
Efficient Multimodal Learning from Data-centric Perspective
Code
#14
MiniGPT-v2 (w/ LoRA, w/ extra data)
90
Accuray
· 2023-10-14
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Code
#15
LLaVA-v1 (w/ LoRA, w/ extra data)
SOTA
89.61
Accuray
· 2023-04-17
Visual Instruction Tuning
Code
#16
LLaVA-v1 (w/ LoRA, w/o extra data)
87.86
Accuray
· 2023-04-17
Visual Instruction Tuning
Code
#17
LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)
87.22
Accuray
· 2023-06-01
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Code