TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/BLIP-2 T5

BLIP-2 T5

Reported on 8 benchmarks across 1 task · 1 paper · 8 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing8 results

  • Visual Question Answering (VQA)onActivityNet
    ClipMatch@1· 2024-02-11
    53.39
    SOTA
    Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchyarXiv:2402.07270
  • Visual Question Answering (VQA)onActivityNet
    ClipMatch@5· 2024-02-11
    74.71
    SOTA
    Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchyarXiv:2402.07270
  • Visual Question Answering (VQA)onActivityNet
    Contains· 2024-02-11
    15.7
    SOTA
    Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchyarXiv:2402.07270
  • Visual Question Answering (VQA)onActivityNet
    ExactMatch· 2024-02-11
    7.07
    SOTA
    Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchyarXiv:2402.07270
  • Visual Question Answering (VQA)onActivityNet
    Follow-up ClipMatch@1· 2024-02-11
    62.02
    SOTA
    Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchyarXiv:2402.07270
  • Visual Question Answering (VQA)onActivityNet
    Follow-up ClipMatch@5· 2024-02-11
    75.13
    SOTA
    Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchyarXiv:2402.07270
  • Visual Question Answering (VQA)onActivityNet
    Follow-up Contains· 2024-02-11
    18.09
    SOTA
    Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchyarXiv:2402.07270
  • Visual Question Answering (VQA)onActivityNet
    Follow-up ExactMatch· 2024-02-11
    8.84
    SOTA
    Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchyarXiv:2402.07270