TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Visual Question Answering

Visual Question Answering

61 benchmarks2177 papers

MLLM Leaderboard

Benchmarks

Visual Question Answering on MM-Vet

GPT-4 scoreParams

Visual Question Answering on ViP-Bench

GPT-4 score (bbox)GPT-4 score (human)

Visual Question Answering on VQA v2 test-dev

Accuracy

Visual Question Answering on BenchLMM

GPT-3.5 score

Visual Question Answering on 6-DoF SpatialBench

TotalPosition-relPosition-absOrientation-relOrientation-abs

Visual Question Answering on SME

BLEU-4METEORROUGE-LCIDErSPICEDetectionACC#Learning Samples (N)

Visual Question Answering on EmbSpatial-Bench

Generation

Visual Question Answering on GQA-REX

BLEU-4CIDErGQA-testGQA-valGroundingMETEORROUGE-LSPICE

Visual Question Answering on MMBench

GPT-3.5 score

Visual Question Answering on V*bench

Accuracy

Visual Question Answering on VQA v2 val

Accuracy

Visual Question Answering on MSRVTT-QA

AccuracyTest Accuracy

Visual Question Answering on MMHal-Bench

Hallucination RateScore

Visual Question Answering on MSVD-QA

Accuracy

Visual Question Answering on PlotQA-D1

1:1 Accuracy

Visual Question Answering on PlotQA-D2

1:1 Accuracy

Visual Question Answering on VQA v2

Accuracy

Visual Question Answering on VQA v2 test-std

overallAccuracynumberotheryes/no

Visual Question Answering on AID-VQA

Acc. (test)

Visual Question Answering on AMBER

AccuracyF1

Visual Question Answering on CLEVR

Accuracy

Visual Question Answering on COCO Visual Question Answering (VQA) real images 2.0 open ended

Percentage correct

Visual Question Answering on EarthVQA

Overall Accuracy

Visual Question Answering on GQA

Accuracy

Visual Question Answering on GRIT

VQA (ablation)

Visual Question Answering on MapEval-Visual

Accuracy (% )

Visual Question Answering on RSVQA-HR

zero-shot Acc

Visual Question Answering on SIRI-WHU

Acc. (test)

Visual Question Answering on TextVQA test-standard

overall

Visual Question Answering on VisualMRC

CIDEr

Visual Question Answering on VizWiz

Accuracy

Visual Question Answering on MM-Vet (w/o External Tools)

GPT-4 score

Visual Question Answering on MM-Vet v2

GPT-4 scoreParams