Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Evaluating the Semantic Profiling Abilities of LLMs for Natural Language Utterances in Data Visualization (2407.06129v2)

Published 8 Jul 2024 in cs.AI and cs.HC

Abstract: Automatically generating data visualizations in response to human utterances on datasets necessitates a deep semantic understanding of the data utterance, including implicit and explicit references to data attributes, visualization tasks, and necessary data preparation steps. Natural Language Interfaces (NLIs) for data visualization have explored ways to infer such information, yet challenges persist due to inherent uncertainty in human speech. Recent advances in LLMs provide an avenue to address these challenges, but their ability to extract the relevant semantic information remains unexplored. In this study, we evaluate four publicly available LLMs (GPT-4, Gemini-Pro, Llama3, and Mixtral), investigating their ability to comprehend utterances even in the presence of uncertainty and identify the relevant data context and visual tasks. Our findings reveal that LLMs are sensitive to uncertainties in utterances. Despite this sensitivity, they are able to extract the relevant data context. However, LLMs struggle with inferring visualization tasks. Based on these results, we highlight future research directions on using LLMs for visualization generation.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Hannah K. Bako (3 papers)
  2. Xinyi Liu (58 papers)
  3. Kwesi A. Cobbina (1 paper)
  4. Zhicheng Liu (41 papers)
  5. Arshnoor Bhutani (1 paper)
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets