Visual Question Answering (VQA) on ImageNet
Metric: ClipMatch@5 (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | ClipMatch@5▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | BLIP-2 OPT | 77.24 | No | Open-ended VQA benchmarking of Vision-Language m... | 2024-02-11 | Code |