Are LLMs ready for Visualization? (2403.06158v1)
Abstract: Generative models have received a lot of attention in many areas of academia and the industry. Their capabilities span many areas, from the invention of images given a prompt to the generation of concrete code to solve a certain programming issue. These two paradigmatic cases fall within two distinct categories of requirements, ranging from "creativity" to "precision", as characterized by Bing Chat, which employs ChatGPT-4 as its backbone. Visualization practitioners and researchers have wondered to what end one of such systems could accomplish our work in a more efficient way. Several works in the literature have utilized them for the creation of visualizations. And some tools such as Lida, incorporate them as part of their pipeline. Nevertheless, to the authors' knowledge, no systematic approach for testing their capabilities has been published, which includes both extensive and in-depth evaluation. Our goal is to fill that gap with a systematic approach that analyzes three elements: whether LLMs are capable of correctly generating a large variety of charts, what libraries they can deal with effectively, and how far we can go to configure individual charts. To achieve this objective, we initially selected a diverse set of charts, which are commonly utilized in data visualization. We then developed a set of generic prompts that could be used to generate them, and analyzed the performance of different LLMs and libraries. The results include both the set of prompts and the data sources, as well as an analysis of the performance with different configurations.
- J. Bertin. Sémiologie graphique: Les diagrammes-les réseaux-les cartes. Technical report, Gauthier-VillarsMouton & Cie, 1973.
- Type-directed synthesis of visualizations from natural language queries. Proceedings of the ACM on Programming Languages, 6(OOPSLA2):532–559, 2022.
- A multi-modal natural language interface to an information visualization environment. International Journal of Speech Technology, 4:297–314, 2001.
- Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.
- V. C. Dibia. Lida: A tool for automatic generation of grammar-agnostic visualizations and infographics using large language models. Annual Meeting of the Association for Computational Linguistics, 2023. doi: 10 . 48550/arxiv . 2303 . 02927
- L. Floridi and M. Chiriatti. Gpt-3: Its nature, scope, limits, and consequences. Minds and Machines, 30:681–694, 2020.
- Datatone: Managing ambiguity in natural language interfaces for data visualization. In Proceedings of the 28th annual acm symposium on user interface software & technology, pp. 489–500, 2015.
- How good is chatgpt in giving advice on your visualization design? arXiv preprint arXiv:2310.09617, 2023.
- Beyond mouse and keyboard: Expanding design considerations for information visualization interactions. IEEE Transactions on Visualization and Computer Graphics, 18(12):2689–2698, 2012.
- P. Maddigan and T. Susnjak. Chat2vis: Generating data visualizations via natural language using chatgpt, codex and gpt-3 large language models. IEEE Access, 2023. doi: 10 . 1109/access . 2023 . 3274199
- Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781, 2013.
- T. Munzner. Visualization analysis and design. CRC press, 2014.
- Nl4dv: A toolkit for generating analytic specifications for data visualization from natural language queries. IEEE Transactions on Visualization and Computer Graphics, 27(2):369–379, 2020.
- Gpt-4 technical report, 2023.
- Deepeye: An automatic big data visualization framework. null, 2018. doi: 10 . 26599/bdma . 2018 . 9020007
- Intelligent visualization system for big multi-source medical data based on data lake. null, 2021. doi: 10 . 1007/978-3-030-87571-8_61
- Code llama: Open foundation models for code. arXiv preprint arXiv:2308.12950, 2023.
- Vega-lite: A grammar of interactive graphics. IEEE transactions on visualization and computer graphics, 23(1):341–350, 2016.
- Eviza: A natural language interface for visual analysis. In Proceedings of the 29th annual symposium on user interface software and technology, pp. 365–377, 2016.
- Towards natural language interfaces for data visualization: A survey. IEEE Transactions on Visualization and Computer Graphics, 2022. doi: 10 . 1109/tvcg . 2022 . 3148007
- C. Sievert. Interactive web-based data visualization with R, plotly, and shiny. CRC Press, 2020.
- N. Sultanum and A. Srinivasan. Datatales: Investigating the use of large language models for authoring data-driven articles. arXiv.org, 2023. doi: 10 . 48550/arxiv . 2308 . 04076
- Altair: interactive statistical visualizations for python. Journal of open source software, 3(32):1057, 2018.
- Challenges in designing natural language interfaces for complex visual models. In Proceedings of the First Workshop on Bridging Human–Computer Interaction and Natural Language Processing, pp. 66–73, 2021.
- Towards natural language-based visualization authoring. IEEE Transactions on Visualization and Computer Graphics, 2022. doi: 10 . 1109/tvcg . 2022 . 3209357
- C. O. Wilke. Fundamentals of data visualization: a primer on making informative and compelling figures. O’Reilly Media, 2019.
- Ai4vis: Survey on artificial intelligence approaches for data visualization. arXiv: Human-Computer Interaction, 2021.
- Pere-Pau Vázquez (6 papers)