Visual Question Answering (VQA) on ImageNet

Metric: ClipMatch@5 (higher is better)

LeaderboardDataset
Loading chart...
#ModelClipMatch@5Extra DataPaperDateCode
1BLIP-2 OPT77.24NoOpen-ended VQA benchmarking of Vision-Language m...2024-02-11Code