Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

InferEM: Inferring the Speaker's Intention for Empathetic Dialogue Generation (2212.06373v7)

Published 13 Dec 2022 in cs.CL and cs.HC

Abstract: Current approaches to empathetic response generation typically encode the entire dialogue history directly and put the output into a decoder to generate friendly feedback. These methods focus on modelling contextual information but neglect capturing the direct intention of the speaker. We argue that the last utterance in the dialogue empirically conveys the intention of the speaker. Consequently, we propose a novel model named InferEM for empathetic response generation. We separately encode the last utterance and fuse it with the entire dialogue through the multi-head attention based intention fusion module to capture the speaker's intention. Besides, we utilize previous utterances to predict the last utterance, which simulates human's psychology to guess what the interlocutor may speak in advance. To balance the optimizing rates of the utterance prediction and response generation, a multi-task learning strategy is designed for InferEM. Experimental results demonstrate the plausibility and validity of InferEM in improving empathetic expression.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. (1997). Empathy and attitudes: Can feeling for a member of a stigmatized group improve feelings toward the group? Journal of Personality And Social Psychology, 72(1), 105–118. doi: 10.1037/0022-3514.72.1.105
  2. (1997). Empathy reconsidered: New directions in psychotherapy (A. C. Bohart  L. S. Greenberg, Eds.). American Psychological Association. doi: 10.1080/10503309912331332721
  3. (2022). Emphi: Generating empathetic responses with human-like intents. In Proceedings of the conference of the north american chapter of the association for computational linguistics: Human language technologies (pp. 1063–1074). Seattle, United States: Association for Computational Linguistics. doi: 10.18653/v1/2022.naacl-main.78
  4. Davis, M. H.  (1983). Measuring individual differences in empathy: Evidence for a multidimensional approach. Journal of Personality and Social Psychology, 44(1), 113–126. doi: 10.1037/0022-3514.44.1.113
  5. (2021). Improving empathetic response generation by recognizing emotion cause in conversations. In Findings of the association for computational linguistics: Emnlp 2021 (pp. 807–819). doi: 10.18653/v1/2021.findings-emnlp.70
  6. (2009). Joint action, interactive alignment, and dialog. Topics in Cognitive Science, 1(2), 292–304. doi: 10.1111/j.1756-8765.2009.01020.x
  7. (2021). Mmgcn: Multimodal fusion via deep graph convolution network for emotion recognition in conversation. In Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (p. 5666-5675). Association for Computational Linguistics. doi: 10.18653/v1/2021.acl-long.440
  8. (2019). Improving neural response diversity with frequency-aware cross-entropy loss. In The world wide web conference (pp. 2879–2885). doi: 10.1145/3308558.3313415
  9. (2015). Adam: A method for stochastic optimization. In International conference on learning representations (pp. 1–13).
  10. (2015). A diversity-promoting objective function for neural conversation models. In Proceedings of the 2016 conference of the north american chapter of the association for computational linguistics: Human language technologies (pp. 110–119). doi: 10.18653/v1/n16-1014
  11. (2023). Graphcfc: A directed graph based cross-modal feature complementation approach for multimodal conversational emotion recognition. IEEE Transactions on Multimedia, 1-13. doi: 10.1109/TMM.2023.3260635
  12. (2022). Knowledge bridging for empathetic dialogue generation. In Proceedings of the aaai conference on artificial intelligence. doi: 10.1609/aaai.v36i10.21347
  13. (2019). Moel: Mixture of empathetic listeners. In Proceedings of the conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (emnlp-ijcnlp) (pp. 121–132). doi: 10.18653/v1/d19-1012
  14. (2021). Towards emotional support dialog systems. In Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (volume 1: Long papers). doi: 10.18653/v1/2021.acl-long.269
  15. (2020). Mime: Mimicking emotions for empathetic response generation. In Proceedings of the 2020 conference on empirical methods in natural language processing (emnlp). doi: 10.18653/v1/2020.emnlp-main.721
  16. (2012). “i help because i want to, not because you tell me to”: Empathy increases autonomously motivated helping. Personality and Social Psychology Bulletin, 38(5), 681–689. doi: 10.1177/0146167211435940
  17. (2014). Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (emnlp) (pp. 1532–1543). doi: 10.3115/v1/d14-1162
  18. (2013). Giving to others and the association between stress and mortality. American Journal of Public Health, 103(9), 1649–1655. doi: 10.2105/AJPH.2012.300876
  19. (2018). Towards empathetic open-domain conversation models: A new benchmark and dataset. In Proceedings of the 57th annual meeting of the association for computational linguistics (pp. 5370–5381). doi: 10.18653/v1/p19-1534
  20. (2022). Cem: Commonsense-aware empathetic response generation. In Proceedings of the aaai conference on artificial intelligence (Vol. 36, pp. 11229–11237). doi: 10.1609/aaai.v36i10.21373
  21. (2015). Hierarchical neural network generative models for movie dialogues. , 7(8), 434–441.
  22. (2023). Human-ai collaboration enables more empathic conversations in text-based peer-to-peer mental health support. Nature Machine Intelligence, 1–12. doi: 10.1038/s42256-022-00593-2
  23. (2017). Conceptnet 5.5: An open multilingual graph of general knowledge. In Proceedings of the aaai conference on artificial intelligence. doi: 10.1609/aaai.v31i1.11164
  24. (2017). Attention is all you need. In Advances in neural information processing systems (Vol. 30).
  25. (2018). Graph attention networks. In International conference on learning representations (pp. 1–12).
  26. (2019). Transfertransfo: A transfer learning approach for neural network based conversational agents. In Proceedings of the conference on neural information processing systems.
  27. (2018). Modeling multi-turn conversation with deep utterance aggregation. In Proceedings of the 27th international conference on computational linguistics (pp. 3740–3752).
  28. (2019). Knowledge-enriched transformer for emotion detection in textual conversations. In Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (emnlp-ijcnlp) (pp. 165–176). doi: 10.18653/v1/d19-1016
  29. (2020). The design and implementation of xiaoice, an empathetic social chatbot. Computational Linguistics, 46(1), 53–93. doi: 10.1162/coli˙a˙00368
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Guoqing Lv (4 papers)
  2. Jiang Li (48 papers)
  3. Xiaoping Wang (56 papers)
  4. Zhigang Zeng (28 papers)
Citations (1)