Multimodal Reasoning on AlgoPuzzleVQA

Metric: Acc (higher is better)

LeaderboardDataset
Loading chart...
#ModelAccExtra DataPaperDateCode
1GPT-430.3NoAre Language Models Puzzle Prodigies? Algorithmi...2024-03-06Code