Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

User Interaction Patterns and Breakdowns in Conversing with LLM-Powered Voice Assistants (2309.13879v2)

Published 25 Sep 2023 in cs.HC

Abstract: Conventional Voice Assistants (VAs) rely on traditional LLMs to discern user intent and respond to their queries, leading to interactions that often lack a broader contextual understanding, an area in which LLMs excel. However, current LLMs are largely designed for text-based interactions, thus making it unclear how user interactions will evolve if their modality is changed to voice. In this work, we investigate whether LLMs can enrich VA interactions via an exploratory study with participants (N=20) using a ChatGPT-powered VA for three scenarios (medical self-diagnosis, creative planning, and discussion) with varied constraints, stakes, and objectivity. We observe that LLM-powered VA elicits richer interaction patterns that vary across tasks, showing its versatility. Notably, LLMs absorb the majority of VA intent recognition failures. We additionally discuss the potential of harnessing LLMs for more resilient and fluid user-VA interactions and provide design guidelines for tailoring LLMs for voice assistance.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Amama Mahmood (9 papers)
  2. Junxiang Wang (35 papers)
  3. Bingsheng Yao (49 papers)
  4. Dakuo Wang (87 papers)
  5. Chien-Ming Huang (31 papers)
Citations (4)