Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Visual Question Answering
Visual Question Answering
61 benchmarks
2177 papers
MLLM Leaderboard
Benchmarks
Visual Question Answering on
MM-Vet
GPT-4 score
Params
Visual Question Answering on
ViP-Bench
GPT-4 score (bbox)
GPT-4 score (human)
Visual Question Answering on
VQA v2 test-dev
Accuracy
Visual Question Answering on
BenchLMM
GPT-3.5 score
Visual Question Answering on
6-DoF SpatialBench
Total
Position-rel
Position-abs
Orientation-rel
Orientation-abs
Visual Question Answering on
SME
BLEU-4
METEOR
ROUGE-L
CIDEr
SPICE
Detection
ACC
#Learning Samples (N)
Visual Question Answering on
EmbSpatial-Bench
Generation
Visual Question Answering on
GQA-REX
BLEU-4
CIDEr
GQA-test
GQA-val
Grounding
METEOR
ROUGE-L
SPICE
Visual Question Answering on
MMBench
GPT-3.5 score
Visual Question Answering on
V*bench
Accuracy
Visual Question Answering on
VQA v2 val
Accuracy
Visual Question Answering on
MSRVTT-QA
Accuracy
Test Accuracy
Visual Question Answering on
MMHal-Bench
Hallucination Rate
Score
Visual Question Answering on
MSVD-QA
Accuracy
Visual Question Answering on
PlotQA-D1
1:1 Accuracy
Visual Question Answering on
PlotQA-D2
1:1 Accuracy
Visual Question Answering on
VQA v2
Accuracy
Visual Question Answering on
VQA v2 test-std
overall
Accuracy
number
other
yes/no
Visual Question Answering on
AID-VQA
Acc. (test)
Visual Question Answering on
AMBER
Accuracy
F1
Visual Question Answering on
CLEVR
Accuracy
Visual Question Answering on
COCO Visual Question Answering (VQA) real images 2.0 open ended
Percentage correct
Visual Question Answering on
EarthVQA
Overall Accuracy
Visual Question Answering on
GQA
Accuracy
Visual Question Answering on
GRIT
VQA (ablation)
Visual Question Answering on
MapEval-Visual
Accuracy (% )
Visual Question Answering on
RSVQA-HR
zero-shot Acc
Visual Question Answering on
SIRI-WHU
Acc. (test)
Visual Question Answering on
TextVQA test-standard
overall
Visual Question Answering on
VisualMRC
CIDEr
Visual Question Answering on
VizWiz
Accuracy
Visual Question Answering on
MM-Vet (w/o External Tools)
GPT-4 score
Visual Question Answering on
MM-Vet v2
GPT-4 score
Params