TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ChartQA: A Benchmark for Question Answering about Charts w...

ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning

Ahmed Masry, Do Xuan Long, Jia Qing Tan, Shafiq Joty, Enamul Hoque

2022-03-19Findings (ACL) 2022 5Question AnsweringChart Question AnsweringLogical Reasoning
PaperPDFCode(official)

Abstract

Charts are very popular for analyzing data. When exploring charts, people often ask a variety of complex reasoning questions that involve several logical and arithmetic operations. They also commonly refer to visual features of a chart in their questions. However, most existing datasets do not focus on such complex reasoning questions as their questions are template-based and answers come from a fixed-vocabulary. In this work, we present a large-scale benchmark covering 9.6K human-written questions as well as 23.1K questions generated from human-written chart summaries. To address the unique challenges in our benchmark involving visual and logical reasoning over charts, we present two transformer-based models that combine visual features and the data table of the chart in a unified way to answer questions. While our models achieve the state-of-the-art results on the previous datasets as well as on our benchmark, the evaluation also reveals several challenges in answering complex reasoning questions.

Results

TaskDatasetMetricValueModel
Visual Question Answering (VQA)PlotQA1:1 Accuracy66VL-T5-OCR
Visual Question Answering (VQA)PlotQA1:1 Accuracy53.9VisionTapas-OCR
Visual Question Answering (VQA)RealCQA1:1 Accuracy0.178733575026565crct - baseline
Visual Question Answering (VQA)ChartQA1:1 Accuracy45.5VisionTapas-OCR
Chart Question AnsweringPlotQA1:1 Accuracy66VL-T5-OCR
Chart Question AnsweringPlotQA1:1 Accuracy53.9VisionTapas-OCR
Chart Question AnsweringRealCQA1:1 Accuracy0.178733575026565crct - baseline
Chart Question AnsweringChartQA1:1 Accuracy45.5VisionTapas-OCR

Related Papers

From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility2025-07-16Warehouse Spatial Question Answering with LLM Agent2025-07-14Evaluating Attribute Confusion in Fashion Text-to-Image Generation2025-07-09