Visual Question Answering (VQA) on ImageNet

Metric: Follow-up ClipMatch@5 (higher is better)

LeaderboardDataset
Loading chart...
#ModelFollow-up ClipMatch@5Extra DataPaperDateCode
1BLIP-2 OPT83.54NoOpen-ended VQA benchmarking of Vision-Language m...2024-02-11Code