ChartBench: A Benchmark for Complex Visual Reasoning in Charts (2312.15915v3)

Published 26 Dec 2023 in cs.CV

Abstract: Multimodal LLMs (MLLMs) have shown impressive capabilities in image understanding and generation. However, current benchmarks fail to accurately evaluate the chart comprehension of MLLMs due to limited chart types and inappropriate metrics. To address this, we propose ChartBench, a comprehensive benchmark designed to assess chart comprehension and data reliability through complex visual reasoning. ChartBench includes 42 categories, 66.6k charts, and 600k question-answer pairs. Notably, many charts lack data point annotations, which requires MLLMs to derive values similar to human understanding by leveraging inherent chart elements such as color, legends, and coordinate systems. We also design an enhanced evaluation metric, Acc+, to evaluate MLLMs without extensive manual or costly LLM-based evaluations. Furthermore, we propose two baselines based on the chain of thought and supervised fine-tuning to improve model performance on unannotated charts. Extensive experimental evaluations of 18 open-sourced and 3 proprietary MLLMs reveal their limitations in chart comprehension and offer valuable insights for further research. Code and dataset are publicly available at https://chartbench.github.io.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (59)

Authors (6)

Zhengzhuo Xu (16 papers)
Sinan Du (4 papers)
Yiyan Qi (21 papers)
Chengjin Xu (36 papers)
Chun Yuan (127 papers)
Jian Guo (76 papers)

Citations (22)

View on Semantic Scholar

ChartBench: A Benchmark for Complex Visual Reasoning in Charts (2312.15915v3)

Related Papers