TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/GLM-4V

GLM-4V

Reported on 24 benchmarks across 3 tasks · 1 paper

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing16 results

  • Visual Question Answering (VQA)onSME
    #Learning Samples (N)· 2023-11-06
    16
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question Answering (VQA)onSME
    ACC· 2023-11-06
    34.23
    best: 51.45 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question Answering (VQA)onSME
    BLEU-4· 2023-11-06
    14.45
    best: 67.91 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question Answering (VQA)onSME
    CIDEr· 2023-11-06
    127.37
    best: 510.44 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question Answering (VQA)onSME
    Detection· 2023-11-06
    0.89
    best: 29.09 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question Answering (VQA)onSME
    METEOR· 2023-11-06
    17.53
    best: 50.55 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question Answering (VQA)onSME
    ROUGE-L· 2023-11-06
    24.28
    best: 79.41 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question Answering (VQA)onSME
    SPICE· 2023-11-06
    17.7
    best: 64.09 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question AnsweringonSME
    #Learning Samples (N)· 2023-11-06
    16
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question AnsweringonSME
    ACC· 2023-11-06
    34.23
    best: 51.45 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question AnsweringonSME
    BLEU-4· 2023-11-06
    14.45
    best: 67.91 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question AnsweringonSME
    CIDEr· 2023-11-06
    127.37
    best: 510.44 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question AnsweringonSME
    Detection· 2023-11-06
    0.89
    best: 29.09 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question AnsweringonSME
    METEOR· 2023-11-06
    17.53
    best: 50.55 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question AnsweringonSME
    ROUGE-L· 2023-11-06
    24.28
    best: 79.41 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Visual Question AnsweringonSME
    SPICE· 2023-11-06
    17.7
    best: 64.09 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079

Computer Vision8 results

  • Explanatory Visual Question AnsweringonSME
    #Learning Samples (N)· 2023-11-06
    16
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Explanatory Visual Question AnsweringonSME
    ACC· 2023-11-06
    34.23
    best: 51.45 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Explanatory Visual Question AnsweringonSME
    BLEU-4· 2023-11-06
    14.45
    best: 67.91 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Explanatory Visual Question AnsweringonSME
    CIDEr· 2023-11-06
    127.37
    best: 510.44 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Explanatory Visual Question AnsweringonSME
    Detection· 2023-11-06
    0.89
    best: 29.09 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Explanatory Visual Question AnsweringonSME
    METEOR· 2023-11-06
    17.53
    best: 50.55 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Explanatory Visual Question AnsweringonSME
    ROUGE-L· 2023-11-06
    24.28
    best: 79.41 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079
  • Explanatory Visual Question AnsweringonSME
    SPICE· 2023-11-06
    17.7
    best: 64.09 (MEAgent)
    CogVLM: Visual Expert for Pretrained Language ModelsarXiv:2311.03079