Visual Question Answering (VQA) on ImageNet

Metric: ClipMatch@1 (higher is better)

LeaderboardDataset
Loading chart...
#ModelClipMatch@1Extra DataPaperDateCode
1BLIP-2 OPT57.1NoOpen-ended VQA benchmarking of Vision-Language m...2024-02-11Code