TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/LLaVA-1.5-13B

LLaVA-1.5-13B

Reported on 8 benchmarks across 5 tasks · 3 papers · 1 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Other3 results

  • Factual Inconsistency Detection in Chart CaptioningonCHOCOLATE-FT
    Kendall's Tau-c· 2023-10-05
    0.214
    best: 0.291 (Bard (before Gemini))
    SOTA
    Improved Baselines with Visual Instruction TuningarXiv:2310.03744
  • Factual Inconsistency Detection in Chart CaptioningonCHOCOLATE-LVLM
    Kendall's Tau-c· 2023-10-05
    0.002
    best: 0.178 (ChartVE)
    Improved Baselines with Visual Instruction TuningarXiv:2310.03744
  • Factual Inconsistency Detection in Chart CaptioningonCHOCOLATE-LLM
    Kendall's Tau-c· 2023-10-05
    0.057
    best: 0.205 (GPT-4V)
    Improved Baselines with Visual Instruction TuningarXiv:2310.03744

Natural Language Processing3 results

  • Visual Question Answering (VQA)onIllusionVQA
    Accuracy· 2024-03-23
    40
    best: 62.99 (GPT4-Vision 4-shot)
    IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language ModelsarXiv:2403.15952
  • Visual Question Answering (VQA)onBenchLMM
    GPT-3.5 score· 2023-10-05
    55.53
    best: 58.37 (GPT-4V)
    Improved Baselines with Visual Instruction TuningarXiv:2310.03744
  • Visual Question AnsweringonBenchLMM
    GPT-3.5 score· 2023-10-05
    55.53
    best: 58.37 (GPT-4V)
    Improved Baselines with Visual Instruction TuningarXiv:2310.03744

Computer Vision2 results

  • Object LocalizationonIllusionVQA
    Accuracy· 2024-03-23
    24.8
    best: 49.7 (GPT4-Vision 4-shot+CoT)
    IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language ModelsarXiv:2403.15952
  • MMR totalonMRR-Benchmark
    Total Column Score· uses extra data· 2023-04-17
    243
    best: 463 (Claude 3.5 Sonnet)
    Visual Instruction TuningarXiv:2304.08485