TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Image Classification/ColonINST-v1 (Unseen)

Image Classification on ColonINST-v1 (Unseen)

Metric: Accuray (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuray▼Extra DataPaperDate↕Code
1ColonGPT (w/ LoRA, w/o extra data)83.24NoFrontiers in Intelligent Colonoscopy2024-10-22Code
2LLaVA-v1.5 (w/ LoRA, w/ extra data)80.89NoImproved Baselines with Visual Instruction Tuning2023-10-05Code
3MobileVLM-1.7B (w/ LoRA, w/ extra data)80.44NoMobileVLM : A Fast, Strong and Open Vision Langu...2023-12-28Code
4Bunny-v1.0-3B (w/ LoRA, w/ extra data)79.5NoEfficient Multimodal Learning from Data-centric ...2024-02-18Code
5LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)79.24NoLLaVA-Med: Training a Large Language-and-Vision ...2023-06-01Code
6LLaVA-v1.5 (w/ LoRA, w/o extra data)79.1NoImproved Baselines with Visual Instruction Tuning2023-10-05Code
7MGM-2B (w/o LoRA, w/o extra data)78.99NoMini-Gemini: Mining the Potential of Multi-modal...2024-03-27Code
8MobileVLM-1.7B (w/o LoRA, w/ extra data)78.75NoMobileVLM : A Fast, Strong and Open Vision Langu...2023-12-28Code
9MGM-2B (w/o LoRA, w/ extra data)78.69NoMini-Gemini: Mining the Potential of Multi-modal...2024-03-27Code
10LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)78.04NoLLaVA-Med: Training a Large Language-and-Vision ...2023-06-01Code
11MiniGPT-v2 (w/ LoRA, w/o extra data)77.93NoMiniGPT-v2: large language model as a unified in...2023-10-14Code
12LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)77.38NoLLaVA-Med: Training a Large Language-and-Vision ...2023-06-01Code
13MiniGPT-v2 (w/ LoRA, w/ extra data)76.82NoMiniGPT-v2: large language model as a unified in...2023-10-14Code
14Bunny-v1.0-3B (w/ LoRA, w/o extra data)75.5NoEfficient Multimodal Learning from Data-centric ...2024-02-18Code
15LLaVA-v1 (w/ LoRA, w/o extra data)72.08NoVisual Instruction Tuning2023-04-17Code
16LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)66.51NoLLaVA-Med: Training a Large Language-and-Vision ...2023-06-01Code
17LLaVA-v1 (w/ LoRA, w/ extra data)42.17NoVisual Instruction Tuning2023-04-17Code