TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/LGSGM

LGSGM

Reported on 11 benchmarks across 2 tasks · 1 paper · 5 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision7 results

  • Image RetrievalonFlickr30K 1K test
    R@10· 2021-06-04
    90.2
    best: 98.7 (X-VLM (base))
    SOTA
    A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalarXiv:2106.02400
  • Image RetrievalonFlickr30K 1K test
    R@5· 2021-06-04
    84.1
    best: 97.3 (X-VLM (base))
    SOTA
    A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalarXiv:2106.02400
  • Image RetrievalonFlickr30k
    Recall@10· 2021-06-04
    90.2
    best: 98.9 (BLIP-2 ViT-G (zero-shot, 1K test set))
    SOTA
    A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalarXiv:2106.02400
  • Image RetrievalonFlickr30k
    Recall@5· 2021-06-04
    84.1
    best: 98.1 (BLIP-2 ViT-G (zero-shot, 1K test set))
    SOTA
    A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalarXiv:2106.02400
  • Image RetrievalonFlickr30k
    Recall@Sum· 2021-06-04
    231.7
    SOTA
    A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalarXiv:2106.02400
  • Image RetrievalonFlickr30K 1K test
    R@1· 2021-06-04
    57.4
    best: 86.9 (X-VLM (base))
    A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalarXiv:2106.02400
  • Image RetrievalonFlickr30k
    Recall@1· 2021-06-04
    57.4
    best: 89.7 (BLIP-2 ViT-G (zero-shot, 1K test set))
    A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalarXiv:2106.02400

Natural Language Processing4 results

  • Image-to-Text RetrievalonFlickr30k
    Recall@1· 2021-06-04
    71
    best: 97.9 (InternVL-G-FT (finetuned, w/o ranking))
    A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalarXiv:2106.02400
  • Image-to-Text RetrievalonFlickr30k
    Recall@10· 2021-06-04
    96.1
    best: 100 (InternVL-G-FT (finetuned, w/o ranking))
    A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalarXiv:2106.02400
  • Image-to-Text RetrievalonFlickr30k
    Recall@5· 2021-06-04
    91.9
    best: 100 (InternVL-G-FT (finetuned, w/o ranking))
    A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalarXiv:2106.02400
  • Image-to-Text RetrievalonFlickr30k
    Recall@Sum· 2021-06-04
    259
    best: 268 (GSMN)
    A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalarXiv:2106.02400