Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (2406.12707v1)

Published 18 Jun 2024 in cs.CL, cs.AI, cs.SD, and eess.AS

Abstract: LLM-enhanced agents become increasingly prevalent in Human-AI communication, offering vast potential from entertainment to professional domains. However, current multi-modal dialogue systems overlook the acoustic information present in speech, which is crucial for understanding human communication nuances. This oversight can lead to misinterpretations of speakers' intentions, resulting in inconsistent or even contradictory responses within dialogues. To bridge this gap, in this paper, we propose PerceptiveAgent, an empathetic multi-modal dialogue system designed to discern deeper or more subtle meanings beyond the literal interpretations of words through the integration of speech modality perception. Employing LLMs as a cognitive core, PerceptiveAgent perceives acoustic information from input speech and generates empathetic responses based on speaking styles described in natural language. Experimental results indicate that PerceptiveAgent excels in contextual understanding by accurately discerning the speakers' true intentions in scenarios where the linguistic meaning is either contrary to or inconsistent with the speaker's true feelings, producing more nuanced and expressive spoken dialogues. Code is publicly available at: \url{https://github.com/Haoqiu-Yan/PerceptiveAgent}.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Haoqiu Yan (1 paper)
  2. Yongxin Zhu (16 papers)
  3. Kai Zheng (134 papers)
  4. Bing Liu (212 papers)
  5. Haoyu Cao (12 papers)
  6. Deqiang Jiang (20 papers)
  7. Linli Xu (33 papers)

Summary

We haven't generated a summary for this paper yet.