Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/BLIP-2 T5

BLIP-2 T5

Reported on 8 benchmarks across 1 task · 1 paper · 8 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing8 results

Visual Question Answering (VQA)onActivityNet
ClipMatch@1· 2024-02-11
53.39
SOTA
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy arXiv:2402.07270
Visual Question Answering (VQA)onActivityNet
ClipMatch@5· 2024-02-11
74.71
SOTA
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy arXiv:2402.07270
Visual Question Answering (VQA)onActivityNet
Contains· 2024-02-11
15.7
SOTA
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy arXiv:2402.07270
Visual Question Answering (VQA)onActivityNet
ExactMatch· 2024-02-11
7.07
SOTA
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy arXiv:2402.07270
Visual Question Answering (VQA)onActivityNet
Follow-up ClipMatch@1· 2024-02-11
62.02
SOTA
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy arXiv:2402.07270
Visual Question Answering (VQA)onActivityNet
Follow-up ClipMatch@5· 2024-02-11
75.13
SOTA
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy arXiv:2402.07270
Visual Question Answering (VQA)onActivityNet
Follow-up Contains· 2024-02-11
18.09
SOTA
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy arXiv:2402.07270
Visual Question Answering (VQA)onActivityNet
Follow-up ExactMatch· 2024-02-11
8.84
SOTA
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy arXiv:2402.07270