Influence of Emotional Payload in Prompt Tone on LLM Behavior
Ascertain whether the emotional payload carried by polite or rude phrasing in prompts affects the behavior or accuracy of large language models such as ChatGPT-4o, beyond the models’ token-level processing of the text.
References
After all, the politeness phrase is just a string of words to the LLM, and we don't know if the emotional payload of the phrase matters to the LLM (Bos, 2024).
— Mind Your Tone: Investigating How Prompt Politeness Affects LLM Accuracy (short paper)
(2510.04950 - Dobariya et al., 6 Oct 2025) in Section 5, Discussion and conclusions