TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Code/Chart Question Answering/ChartQA

Chart Question Answering on ChartQA

Metric: 1:1 Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕1:1 Accuracy▼Extra DataPaperDate↕Code
1ChartPaLI-5B + PaLM 2-S81.3YesChart-based Reasoning: Transferring Capabilities...2024-03-19-
2Gemini Ultra80.8NoGemini: A Family of Highly Capable Multimodal Mo...2023-12-19Code
3DePlot+FlanPaLM+Codex (PoT Self-Consistency)79.3NoDePlot: One-shot visual language reasoning by pl...2022-12-20Code
4ChartPaLI-5B77.3YesChart-based Reasoning: Transferring Capabilities...2024-03-19-
5DePlot+Codex (PoT Self-Consistency)76.7NoDePlot: One-shot visual language reasoning by pl...2022-12-20Code
6ScreenAI 5B (4.62 B params, w/ OCR)76.7YesScreenAI: A Vision-Language Model for UI and Inf...2024-02-07Code
7SMoLA-PaLI-X Specialist Model74.6YesOmni-SMoLA: Boosting Generalist Multimodal Model...2023-12-01-
8SMoLA-PaLI-X Generalist Model73.8YesOmni-SMoLA: Boosting Generalist Multimodal Model...2023-12-01-
9MatCha4096 + LaMenDa72.64Yes---
10PaLI-X (Single-task FT w/ OCR)72.3YesPaLI-X: On Scaling up a Multilingual Vision and ...2023-05-29Code
11PaLI-X (Single-task FT)70.9YesPaLI-X: On Scaling up a Multilingual Vision and ...2023-05-29Code
12PaLI-X (Multi-task FT)70.6YesPaLI-X: On Scaling up a Multilingual Vision and ...2023-05-29Code
13DePlot+FlanPaLM (Self-Consistency)70.5NoDePlot: One-shot visual language reasoning by pl...2022-12-20Code
14PaLI-370NoPaLI-3 Vision Language Models: Smaller, Faster, ...2023-10-13Code
15PaLI-3 (w/ OCR)69.5NoPaLI-3 Vision Language Models: Smaller, Faster, ...2023-10-13Code
16DePlot+FlanPaLM (CoT)67.3NoDePlot: One-shot visual language reasoning by pl...2022-12-20Code
17Qwen-VL-Chat66.3YesQwen-VL: A Versatile Vision-Language Model for U...2023-08-24Code
18UniChart66.24YesUniChart: A Universal Vision-language Pretrained...2023-05-24Code
19Qwen-VL65.7YesQwen-VL: A Versatile Vision-Language Model for U...2023-08-24Code
20StructChart+GPT3.5 (STR ChartQA+SimChart9K)65.3YesStructChart: On the Schema, Metric, and Augmenta...2023-09-20Code
21MatCha64.2NoMatCha: Enhancing Visual Language Pretraining wi...2022-12-19Code
22StructChart+GPT3.5 (STR)60.7NoStructChart: On the Schema, Metric, and Augmenta...2023-09-20Code
23Pix2Struct-large58.6NoPix2Struct: Screenshot Parsing as Pretraining fo...2022-10-07Code
24Pix2Struct-base56NoPix2Struct: Screenshot Parsing as Pretraining fo...2022-10-07Code
25VisionTapas-OCR45.5NoChartQA: A Benchmark for Question Answering abou...2022-03-19Code
26DePlot+GPT3 (Self-Consistency)42.3NoDePlot: One-shot visual language reasoning by pl...2022-12-20Code
27DePlot+GPT3 (CoT)36.9NoDePlot: One-shot visual language reasoning by pl...2022-12-20Code