Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Large Language Models are Capable of Offering Cognitive Reappraisal, if Guided (2404.01288v2)

Published 1 Apr 2024 in cs.CL
Large Language Models are Capable of Offering Cognitive Reappraisal, if Guided

Abstract: LLMs have offered new opportunities for emotional support, and recent work has shown that they can produce empathic responses to people in distress. However, long-term mental well-being requires emotional self-regulation, where a one-time empathic response falls short. This work takes a first step by engaging with cognitive reappraisals, a strategy from psychology practitioners that uses language to targetedly change negative appraisals that an individual makes of the situation; such appraisals is known to sit at the root of human emotional experience. We hypothesize that psychologically grounded principles could enable such advanced psychology capabilities in LLMs, and design RESORT which consists of a series of reappraisal constitutions across multiple dimensions that can be used as LLM instructions. We conduct a first-of-its-kind expert evaluation (by clinical psychologists with M.S. or Ph.D. degrees) of an LLM's zero-shot ability to generate cognitive reappraisal responses to medium-length social media messages asking for support. This fine-grained evaluation showed that even LLMs at the 7B scale guided by RESORT are capable of generating empathic responses that can help users reappraise their situations.

LLMs Elicit Advanced Cognitive Reappraisal Capacities with Expert-Guided Framework

Introduction

The evolution of LLMs into domains requiring complex emotional and psychological understanding presents both challenges and opportunities within artificial intelligence research. Recognizing their potential for providing emotional support, this paper explores the field of cognitive reappraisals—a psychological technique pivotal for emotional self-regulation. By innovatively combining the fields of psychology and artificial intelligence, the research investigates the potential of LLMs to support mental well-being beyond mere empathic responses. Focusing on the structured psychological framework , this paper methodically evaluates LLMs' capabilities in generating cognitive reappraisals advised by clinical psychologists, marking a significant stride towards integrating psychological precision within generative AI models.

Cognitive Reappraisal and LLMs

At the heart of emotional experience lie cognitive appraisals—individual interpretations of situations that shape emotional responses. Psychological interventions often target these appraisals to mitigate distress, emphasizing the strategic alteration of these interpretations, known as cognitive reappraisals. Recognizing the scarcity of accessible professional support, this paper introduces , a compilation of six reappraisal constitutions informed by psychological theory, aiming to guide LLMs in generating context-specific reappraisals. The experiment extends across various dimensions of cognitive appraisals, implementing both individual reappraisal generation and iterative refinement strategies, exploring LLMs' potential to apply complex psychological principles in providing support.

Methodology

Employing a rigorous methodological approach, the research leverages posts from social media platforms as real-life scenarios requiring emotional support. The constitution, structured to address six fundamental dimensions of cognitive appraisals, serves as a directive for LLMs to navigate the intricacies of human emotional experiences. The paper meticulously evaluates LLM-generated responses through the lens of clinical psychologists, providing a nuanced understanding of the alignment with psychological theories, perceived empathy, harmfulness, and factuality. This comprehensive evaluation framework not only scrutinizes the technical effectiveness of LLMs but also their pragmatic utility in real-world emotional support contexts.

Findings

The findings unveil a compelling capability of LLMs, even those of modest 7B scale, to generate cognitive reappraisals that outperform not only baseline human-written responses but also the standard empathic reactions often found on social media platforms. The response quality notably improves under guidance from the framework, highlighting the critical role of expert-informed instructions in eliciting sophisticated psychological functionalities from LLMs. The paper’s analytical rigor is further exemplified by its extensive expert evaluation, which reveals a moderate to substantial agreement among clinical psychologists on the utility of LLM-generated reappraisals. Moreover, an exploration into automatic evaluation using GPT-4 as a meta-evaluator presents promising directions for future research methodologies.

Implications and Future Directions

This exploration into the intersection of psychology and artificial intelligence suggests profound implications for both fields. It underscores the potential of LLMs to transcend conventional applications, venturing into the domain of psychological support with remarkable adeptness. The success of the framework in guiding LLMs towards generating meaningful cognitive reappraisals paves the way for future research to further refine these models, enhancing their sensitivity and adherence to psychological principles. Moreover, this paper’s innovative approach to evaluating AI-generated psychological support opens new avenues for research methodologies, leveraging the capabilities of LLMs themselves in the analysis process.

In conclusion, this paper not only demonstrates the feasibility of integrating psychological insights into LLMs but also accentuates the importance of expert guidance in unlocking the advanced capabilities of AI in providing emotional support. As the field advances, the collaboration between artificial intelligence and clinical psychology holds the promise of accessible, informed, and empathetic support systems, augmenting human efforts in fostering mental well-being.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (83)
  1. Magda B Arnold. Emotion and personality. Columbia University Press, 1960.
  2. Survey article: Inter-coder agreement for computational linguistics. Computational Linguistics, 34(4):555–596, 2008. doi: 10.1162/coli.07-034-R2. URL https://aclanthology.org/J08-4004.
  3. Comparing physician and artificial intelligence chatbot responses to patient questions posted to a public social media forum. JAMA Internal Medicine, 2023.
  4. Constitutional ai: Harmlessness from ai feedback, 2022.
  5. A multitask, multilingual, multimodal evaluation of ChatGPT on reasoning, hallucination, and interactivity. In Jong C. Park, Yuki Arase, Baotian Hu, Wei Lu, Derry Wijaya, Ayu Purwarianti, and Adila Alfa Krisnadhi (eds.), Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pp.  675–718, Nusa Dua, Bali, November 2023. Association for Computational Linguistics. doi: 10.18653/v1/2023.ijcnlp-main.45. URL https://aclanthology.org/2023.ijcnlp-main.45.
  6. Aaron T Beck. Thinking and depression: I. idiosyncratic content and cognitive distortions. Archives of general psychiatry, 9(4):324–333, 1963.
  7. Aaron T Beck. Cognitive therapy and the emotional disorders. Penguin, 1979.
  8. Latent dirichlet allocation. Journal of Machine Learning Research, 3(Jan):993–1022, 2003.
  9. Cognitive Reappraisal of Emotion: A Meta-Analysis of Human Neuroimaging Studies. Cerebral Cortex, 24(11):2981–2990, 06 2013. ISSN 1047-3211. doi: 10.1093/cercor/bht154. URL https://doi.org/10.1093/cercor/bht154.
  10. Empathy is hard work: People choose to avoid empathy because of its cognitive costs. Journal of Experimental Psychology: General, 148(6):962, 2019.
  11. PAL: Persona-augmented emotional support conversation generation. In Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki (eds.), Findings of the Association for Computational Linguistics: ACL 2023, pp.  535–554, Toronto, Canada, July 2023. Association for Computational Linguistics. doi: 10.18653/v1/2023.findings-acl.34. URL https://aclanthology.org/2023.findings-acl.34.
  12. Improving multi-turn emotional support dialogue generation with lookahead strategy planning. In Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (eds.), Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pp.  3014–3026, Abu Dhabi, United Arab Emirates, December 2022. Association for Computational Linguistics. doi: 10.18653/v1/2022.emnlp-main.195. URL https://aclanthology.org/2022.emnlp-main.195.
  13. Barriers to healthcare access among us adults with mental health challenges: a population-based study. SSM-population health, 15:100847, 2021.
  14. Emerging mental health issues during the covid-19 pandemic: An indian perspective. Indian journal of psychiatry, 62(Suppl 3):S354, 2020.
  15. Using large language models in psychology. Nature Reviews Psychology, October 2023. ISSN 2731-0574. doi: 10.1038/s44159-023-00241-5. URL https://doi.org/10.1038/s44159-023-00241-5.
  16. Jacquelynne S Eccles. Expectancies, values, and academic behaviors. In Achievement and achievement motives, pp.  75–146. Freeman, 1983.
  17. Appraisal processes in emotion. In R. J. Davidson, K. R. Scherer, and H. H. Goldsmith (eds.), Handbook of Affective Sciences, pp.  572–595. Oxford University Press, 2003.
  18. Relations among emotion, appraisal, and emotional action readiness. Journal of personality and social psychology, 57(2):212, 1989.
  19. Reappraisal, 2009.
  20. The dynamics of real-time classroom emotions: Appraisals mediate the relation between students’ perceptions of teaching and their emotions. Journal of Educational Psychology, 112(6):1243, 2020.
  21. The neural bases of emotion regulation: reappraisal and suppression of negative emotion. Biological psychiatry, 63(6):577–586, 2008.
  22. James J Gross. Antecedent-and response-focused emotion regulation: divergent consequences for experience, expression, and physiology. Journal of personality and social psychology, 74(1):224, 1998a.
  23. James J Gross. The emerging field of emotion regulation: An integrative review. Review of general psychology, 2(3):271–299, 1998b.
  24. Individual differences in two emotion regulation processes: implications for affect, relationships, and well-being. Journal of personality and social psychology, 85(2):348, 2003.
  25. In praise of empathic ai. Trends in Cognitive Sciences, 2023.
  26. Survey of hallucination in natural language generation. ACM Comput. Surv., 55(12), mar 2023. ISSN 0360-0300. doi: 10.1145/3571730. URL https://doi.org/10.1145/3571730.
  27. Mistral 7b, 2023.
  28. How to improve others’ emotions: Reappraise and be responsive. Affective Science, 4(2):233–247, 2023.
  29. Achievement goals and emotions in athletes: The mediating role of challenge and threat appraisals. Motivation and Emotion, 38:589–599, 2014.
  30. Leanne K Knobloch. Evaluating a contextual model of responses to relational uncertainty increasing events: The role of intimacy, appraisals, and emotions. Human Communication Research, 31(1):60–101, 2005.
  31. Corrigendum: Effects of an inquiry-based short intervention on state test anxiety in comparison to alternative coping strategies. Frontiers in psychology, 9:438157, 2019.
  32. Richard S Lazarus. Psychological stress and the coping process. McGraw-Hill, 1966.
  33. Richard S Lazarus. Progress on a cognitive-motivational-relational theory of emotion. American psychologist, 46(8):819, 1991.
  34. Large language models produce responses perceived to be empathic. 2024. URL https://arxiv.org/pdf/2403.18148v1.pdf.
  35. HaluEval: A large-scale hallucination evaluation benchmark for large language models. In Houda Bouamor, Juan Pino, and Kalika Bali (eds.), Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pp.  6449–6464, Singapore, December 2023. Association for Computational Linguistics. doi: 10.18653/v1/2023.emnlp-main.397. URL https://aclanthology.org/2023.emnlp-main.397.
  36. Chin-Yew Lin. ROUGE: A package for automatic evaluation of summaries. In Text Summarization Branches Out, pp.  74–81, Barcelona, Spain, July 2004. Association for Computational Linguistics. URL https://aclanthology.org/W04-1013.
  37. LLM-eval: Unified multi-dimensional automatic evaluation for open-domain conversations with large language models. In Yun-Nung Chen and Abhinav Rastogi (eds.), Proceedings of the 5th Workshop on NLP for Conversational AI (NLP4ConvAI 2023), pp.  47–58, Toronto, Canada, July 2023. Association for Computational Linguistics. doi: 10.18653/v1/2023.nlp4convai-1.5. URL https://aclanthology.org/2023.nlp4convai-1.5.
  38. Towards emotional support dialog systems. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp.  3469–3483, Online, August 2021. Association for Computational Linguistics. doi: 10.18653/v1/2021.acl-long.269. URL https://aclanthology.org/2021.acl-long.269.
  39. G-eval: NLG evaluation using gpt-4 with better human alignment. In Houda Bouamor, Juan Pino, and Kalika Bali (eds.), Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pp.  2511–2522, Singapore, December 2023. Association for Computational Linguistics. doi: 10.18653/v1/2023.emnlp-main.153. URL https://aclanthology.org/2023.emnlp-main.153.
  40. Self-refine: Iterative refinement with self-feedback. Advances in Neural Information Processing Systems, 36, 2024.
  41. Training models to generate, recognize, and reframe unhelpful thoughts. In Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki (eds.), Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.  13641–13660, Toronto, Canada, July 2023. Association for Computational Linguistics. doi: 10.18653/v1/2023.acl-long.763. URL https://aclanthology.org/2023.acl-long.763.
  42. Kateri McRae. Cognitive emotion regulation: a review of theory and scientific findings. Current Opinion in Behavioral Sciences, 10:119–124, 2016. URL https://api.semanticscholar.org/CorpusID:20887028.
  43. Feeling shame and guilt when observing workplace incivility: Elicitors and behavioral responses. Human Resource Development Quarterly, 31(4):371–392, 2020.
  44. Kanishka Misra. minicons: Enabling flexible behavioral and representational analyses of transformer language models. arXiv preprint arXiv:2203.13112, 2022.
  45. Rethinking feelings: an fmri study of the cognitive regulation of emotion. Journal of cognitive neuroscience, 14(8):1215–1229, 2002.
  46. Trends in psychological distress and outpatient mental health care of adults during the covid-19 era. Annals of Internal Medicine, 2024.
  47. Affective cognition: Exploring lay theories of emotion. Cognition, 143:141–162, 2015. ISSN 0010-0277. doi: https://doi.org/10.1016/j.cognition.2015.06.010. URL https://www.sciencedirect.com/science/article/pii/S0010027715300196.
  48. Modeling emotion in complex stories: The stanford emotional narratives dataset. IEEE Transactions on Affective Computing, 12:579–594, 2019.
  49. OpenAI. Gpt-4 technical report, 2023.
  50. The cognitive structure of emotions. Cambridge university press, 2022.
  51. Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp.  311–318, Philadelphia, Pennsylvania, USA, July 2002. Association for Computational Linguistics. doi: 10.3115/1073083.1073135. URL https://aclanthology.org/P02-1040.
  52. Reinhard Pekrun. The control-value theory of achievement emotions: Assumptions, corollaries, and implications for educational research and practice. Educational psychology review, 18:315–341, 2006.
  53. Control globally, understand locally: A global-to-local hierarchical graph network for emotional support conversation. In International Joint Conference on Artificial Intelligence, 2022. URL https://api.semanticscholar.org/CorpusID:248406141.
  54. Justus J Randolph. Free-marginal multirater kappa (multirater k [free]): An alternative to fleiss’ fixed-marginal multirater kappa. Online submission, 2005.
  55. Cognitive reappraisal of negative affect: converging evidence from emg and self-report. Emotion, 10(4):587, 2010.
  56. An investigation of dimensions of cognitive appraisal in emotion using the repertory grid technique. Motivation and emotion, 14(1):1–26, 1990.
  57. Appraisal theory. Appraisal processes in emotion: Theory, methods, research, pp.  3–19, 2001.
  58. A comparison of two types of social support for mothers of mentally ill children. Journal of Child and Adolescent Psychiatric Nursing, 22(2):86–98, 2009.
  59. Appraisal theory: Overview, assumptions, varieties, 2001.
  60. Human–ai collaboration enables more empathic conversations in text-based peer-to-peer mental health support. Nature Machine Intelligence, 5(1):46–57, 2023a.
  61. Cognitive reframing of negative thoughts through human-language model interaction. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.  9977–10000, Toronto, Canada, July 2023b. Association for Computational Linguistics. doi: 10.18653/v1/2023.acl-long.555. URL https://aclanthology.org/2023.acl-long.555.
  62. A long way to go: Investigating length correlations in rlhf, 2023.
  63. Patterns of cognitive appraisal in emotion. Journal of personality and social psychology, 48(4):813, 1985.
  64. Appraisal components, core relational themes, and the emotions. Cognition & emotion, 7(3-4):233–269, 1993.
  65. Emotion analysis and detection during COVID-19. In Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, and Stelios Piperidis (eds.), Proceedings of the Thirteenth Language Resources and Evaluation Conference, pp.  6938–6947, Marseille, France, June 2022. European Language Resources Association. URL https://aclanthology.org/2022.lrec-1.750.
  66. Llama 2: Open foundation and fine-tuned chat models. ArXiv, abs/2307.09288, 2023. URL https://api.semanticscholar.org/CorpusID:259950998.
  67. MISC: A mixed strategy-aware model integrating COMET for emotional support conversation. In Smaranda Muresan, Preslav Nakov, and Aline Villavicencio (eds.), Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.  308–319, Dublin, Ireland, May 2022. Association for Computational Linguistics. doi: 10.18653/v1/2022.acl-long.25. URL https://aclanthology.org/2022.acl-long.25.
  68. Emotion regulation changes the duration of the bold response to emotional stimuli. Social Cognitive and Affective Neuroscience, 11(10):1550–1559, 2016.
  69. Chain of thought prompting elicits reasoning in large language models. In Alice H. Oh, Alekh Agarwal, Danielle Belgrave, and Kyunghyun Cho (eds.), Advances in Neural Information Processing Systems, 2022.
  70. Transformers: State-of-the-art natural language processing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pp.  38–45, Online, October 2020. Association for Computational Linguistics. doi: 10.18653/v1/2020.emnlp-demos.6. URL https://www.aclweb.org/anthology/2020.emnlp-demos.6.
  71. Healme: Harnessing cognitive reframing in large language models for psychotherapy, 2024.
  72. Do large language models latently perform multi-hop reasoning?, 2024. URL https://arxiv.org/pdf/2402.16837.pdf.
  73. ReAct: Synergizing reasoning and acting in language models. In International Conference on Learning Representations (ICLR), 2023.
  74. The PEACE-reviews dataset: Modeling cognitive appraisals in emotion text analysis. In Houda Bouamor, Juan Pino, and Kalika Bali (eds.), Findings of the Association for Computational Linguistics: EMNLP 2023, pp.  2822–2840, Singapore, December 2023. Association for Computational Linguistics. URL https://aclanthology.org/2023.findings-emnlp.186.
  75. A meta-analytic review of the associations between cognitive appraisals and emotions in cognitive appraisal theory. PsyArXiv, 2023. URL https://psyarxiv.com/ystxc.
  76. Jamil Zaki. Empathy: a motivated account. Psychological Bulletin, 140(6):1608, 2014.
  77. Why do you feel this way? summarizing triggers of emotions in social media posts. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pp.  9436–9453, Abu Dhabi, United Arab Emirates, December 2022. Association for Computational Linguistics. URL https://aclanthology.org/2022.emnlp-main.642.
  78. Evaluating subjective cognitive appraisals of emotions from large language models. In Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 2023. Association for Computational Linguistics.
  79. Bertscore: Evaluating text generation with bert. In International Conference on Learning Representations, 2020. URL https://openreview.net/forum?id=SkeHuCVFDr.
  80. AugESC: Dialogue augmentation with large language models for emotional support conversation. In Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki (eds.), Findings of the Association for Computational Linguistics: ACL 2023, pp.  1552–1568, Toronto, Canada, July 2023. Association for Computational Linguistics. doi: 10.18653/v1/2023.findings-acl.99. URL https://aclanthology.org/2023.findings-acl.99.
  81. Instruction-following evaluation for large language models. arXiv preprint arXiv:2311.07911, 2023a.
  82. Facilitating multi-turn emotional support conversation with positive emotion elicitation: A reinforcement learning approach. In Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki (eds.), Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.  1714–1729, Toronto, Canada, July 2023b. Association for Computational Linguistics. doi: 10.18653/v1/2023.acl-long.96. URL https://aclanthology.org/2023.acl-long.96.
  83. Inducing positive perspectives with text reframing. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.  3682–3700, Dublin, Ireland, May 2022. Association for Computational Linguistics. doi: 10.18653/v1/2022.acl-long.257. URL https://aclanthology.org/2022.acl-long.257.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Hongli Zhan (6 papers)
  2. Allen Zheng (1 paper)
  3. Yoon Kyung Lee (9 papers)
  4. Jina Suh (29 papers)
  5. Junyi Jessy Li (79 papers)
  6. Desmond C. Ong (26 papers)
Citations (3)
Youtube Logo Streamline Icon: https://streamlinehq.com