TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/EXP

EXP

Reported on 24 benchmarks across 3 tasks · 1 paper · 3 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing16 results

  • Visual Question Answering (VQA)onGQA-REX
    Grounding· 2018-09-08
    33.52
    best: 77.33 (VCIN)
    SOTA
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question AnsweringonGQA-REX
    Grounding· 2018-09-08
    33.52
    best: 77.33 (VCIN)
    SOTA
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question Answering (VQA)onGQA-REX
    BLEU-4· 2018-09-08
    42.45
    best: 58.65 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question Answering (VQA)onGQA-REX
    CIDEr· 2018-09-08
    357.1
    best: 519.23 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question Answering (VQA)onGQA-REX
    GQA-test· 2018-09-08
    56.92
    best: 60.61 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question Answering (VQA)onGQA-REX
    GQA-val· 2018-09-08
    65.17
    best: 81.8 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question Answering (VQA)onGQA-REX
    METEOR· 2018-09-08
    34.46
    best: 41.57 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question Answering (VQA)onGQA-REX
    ROUGE-L· 2018-09-08
    73.51
    best: 81.45 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question Answering (VQA)onGQA-REX
    SPICE· 2018-09-08
    40.35
    best: 54.63 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question AnsweringonGQA-REX
    BLEU-4· 2018-09-08
    42.45
    best: 58.65 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question AnsweringonGQA-REX
    CIDEr· 2018-09-08
    357.1
    best: 519.23 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question AnsweringonGQA-REX
    GQA-test· 2018-09-08
    56.92
    best: 60.61 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question AnsweringonGQA-REX
    GQA-val· 2018-09-08
    65.17
    best: 81.8 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question AnsweringonGQA-REX
    METEOR· 2018-09-08
    34.46
    best: 41.57 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question AnsweringonGQA-REX
    ROUGE-L· 2018-09-08
    73.51
    best: 81.45 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Visual Question AnsweringonGQA-REX
    SPICE· 2018-09-08
    40.35
    best: 54.63 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805

Computer Vision8 results

  • Explanatory Visual Question AnsweringonGQA-REX
    Grounding· 2018-09-08
    33.52
    best: 77.33 (VCIN)
    SOTA
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Explanatory Visual Question AnsweringonGQA-REX
    BLEU-4· 2018-09-08
    42.45
    best: 58.65 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Explanatory Visual Question AnsweringonGQA-REX
    CIDEr· 2018-09-08
    357.1
    best: 519.23 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Explanatory Visual Question AnsweringonGQA-REX
    GQA-test· 2018-09-08
    56.92
    best: 60.61 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Explanatory Visual Question AnsweringonGQA-REX
    GQA-val· 2018-09-08
    65.17
    best: 81.8 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Explanatory Visual Question AnsweringonGQA-REX
    METEOR· 2018-09-08
    34.46
    best: 41.57 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Explanatory Visual Question AnsweringonGQA-REX
    ROUGE-L· 2018-09-08
    73.51
    best: 81.45 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805
  • Explanatory Visual Question AnsweringonGQA-REX
    SPICE· 2018-09-08
    40.35
    best: 54.63 (VCIN)
    Faithful Multimodal Explanation for Visual Question AnsweringarXiv:1809.02805