Mechanism of Prompt Phrasing Effects on LLM Accuracy
Determine the specific mechanisms by which variations in prompt phrasing, including levels of politeness and rudeness, affect the accuracy of responses produced by large language models such as ChatGPT-4o on multiple-choice question tasks.
References
At any rate, while LLMs are sensitive to the actual phrasing of the prompt, it is not clear how exactly it affects the results.
— Mind Your Tone: Investigating How Prompt Politeness Affects LLM Accuracy (short paper)
(2510.04950 - Dobariya et al., 6 Oct 2025) in Section 5, Discussion and conclusions