Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue (2212.03398v1)

Published 7 Dec 2022 in eess.AS, cs.CL, and cs.SD

Abstract: Entrainment is the phenomenon by which an interlocutor adapts their speaking style to align with their partner in conversations. It has been found in different dimensions as acoustic, prosodic, lexical or syntactic. In this work, we explore and utilize the entrainment phenomenon to improve spoken dialogue systems for voice assistants. We first examine the existence of the entrainment phenomenon in human-to-human dialogues in respect to acoustic feature and then extend the analysis to emotion features. The analysis results show strong evidence of entrainment in terms of both acoustic and emotion features. Based on this findings, we implement two entrainment policies and assess if the integration of entrainment principle into a Text-to-Speech (TTS) system improves the synthesis performance and the user experience. It is found that the integration of the entrainment principle into a TTS system brings performance improvement when considering acoustic features, while no obvious improvement is observed when considering emotion features.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Daxin Tan (13 papers)
  2. Nikos Kargas (11 papers)
  3. David McHardy (2 papers)
  4. Constantinos Papayiannis (6 papers)
  5. Antonio Bonafonte (14 papers)
  6. Marek Strelec (1 paper)
  7. Jonas Rohnke (5 papers)
  8. Agis Oikonomou Filandras (1 paper)
  9. Trevor Wood (6 papers)

Summary

We haven't generated a summary for this paper yet.