Multimodal Reasoning on AlgoPuzzleVQA
Metric: Acc (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | Acc▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | GPT-4 | 30.3 | No | Are Language Models Puzzle Prodigies? Algorithmi... | 2024-03-06 | Code |