TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Referring expression generation/ColonINST-v1 (Unseen)

Referring expression generation on ColonINST-v1 (Unseen)

Metric: Accuray (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuray▼Extra DataPaperDate↕Code
1ColonGPT (w/ LoRA, w/o extra data)80.18NoFrontiers in Intelligent Colonoscopy2024-10-22Code
2MobileVLM-1.7B (w/ LoRA, w/ extra data)78.03NoMobileVLM : A Fast, Strong and Open Vision Langu...2023-12-28Code
3LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)75.25NoLLaVA-Med: Training a Large Language-and-Vision ...2023-06-01Code
4Bunny-v1.0-3B (w/ LoRA, w/ extra data)75.08NoEfficient Multimodal Learning from Data-centric ...2024-02-18Code
5LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)75.07NoLLaVA-Med: Training a Large Language-and-Vision ...2023-06-01Code
6MGM-2B (w/o LoRA, w/ extra data)74.3NoMini-Gemini: Mining the Potential of Multi-modal...2024-03-27Code
7MobileVLM-1.7B (w/o LoRA, w/ extra data)73.14NoMobileVLM : A Fast, Strong and Open Vision Langu...2023-12-28Code
8LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)73.05NoLLaVA-Med: Training a Large Language-and-Vision ...2023-06-01Code
9LLaVA-v1.5 (w/ LoRA, w/ extra data)72.88NoImproved Baselines with Visual Instruction Tuning2023-10-05Code
10MiniGPT-v2 (w/ LoRA, w/o extra data)72.05NoMiniGPT-v2: large language model as a unified in...2023-10-14Code
11LLaVA-v1.5 (w/ LoRA, w/o extra data)70.38NoImproved Baselines with Visual Instruction Tuning2023-10-05Code
12MiniGPT-v2 (w/ LoRA, w/ extra data)70.23NoMiniGPT-v2: large language model as a unified in...2023-10-14Code
13LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)70NoLLaVA-Med: Training a Large Language-and-Vision ...2023-06-01Code
14MGM-2B (w/o LoRA, w/o extra data)69.81NoMini-Gemini: Mining the Potential of Multi-modal...2024-03-27Code
15Bunny-v1.0-3B (w/ LoRA, w/o extra data)69.45NoEfficient Multimodal Learning from Data-centric ...2024-02-18Code
16LLaVA-v1 (w/ LoRA, w/o extra data)68.11NoVisual Instruction Tuning2023-04-17Code
17LLaVA-v1 (w/ LoRA, w/ extra data)46.85NoVisual Instruction Tuning2023-04-17Code