Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Speech as Interactive Design Material (SIDM): How to design and evaluate task-tailored synthetic voices? (2402.16592v1)

Published 26 Feb 2024 in cs.HC

Abstract: The aim of this workshop is two-fold. First, it aims to establish a research community focused on design and evaluation of synthetic speech (TTS) interfaces that are tailored not only to goal oriented tasks (e.g., food ordering, online shopping) but also personal growth and resilience promoting applications (e.g., coaching, mindful reflection, and tutoring). Second, through discussion and collaborative efforts, to establish a set of practices and standards that will help to improve ecological validity of TTS evaluation. In particular, the workshop will explore the topics such as: interaction design of voice-based conversational interfaces; the interplay between prosodic aspects (e.g., pitch variance, loudness, jitter) of TTS and its impact on voice perception. This workshop will serve as a platform on which to build a community that is better equipped to tackle the dynamic field of interactive TTS interfaces, which remains understudied, yet increasingly pertinent to everyday lives of users.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (10)
  1. The sound of trustworthiness: Acoustic-based modulation of perceived voice personality. PloS one 12, 10 (2017), e0185651.
  2. Julia Cambre and Chinmay Kulkarni. 2019. One voice fits all? Social implications and research challenges of designing voices for smart devices. Proceedings of the ACM on Human-Computer Interaction 3, CSCW (2019), 1–19.
  3. Designing persuasive robots: how robots might persuade people using vocal and nonverbal cues. In Proceedings of the seventh annual ACM/IEEE international conference on Human-Robot Interaction. 293–300.
  4. The state of speech in HCI: Trends, themes and challenges. Interacting with computers 31, 4 (2019), 349–371.
  5. Conversational Agents Trust Calibration: A User-Centred Perspective to Design. In Proceedings of the 4th Conference on Conversational User Interfaces. 1–6.
  6. Persuasive synthetic speech: Voice perception and user behaviour. In Proceedings of the 2nd Conference on Conversational User Interfaces. 1–9.
  7. Aaron C Elkins and Douglas C Derrick. 2013. The sound of trust: voice as a measurement of trust during interactions with embodied conversational agents. Group decision and negotiation 22, 5 (2013), 897–913.
  8. Deep voice 2: Multi-speaker neural text-to-speech. Advances in neural information processing systems 30 (2017).
  9. How do you say ‘Hello’? Personality impressions from brief novel voices. PloS one 9, 3 (2014), e90779.
  10. Voice as a contemporary frontier of interaction design. In European Conference on Information Systems (ECIS).-Virtual.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets