Large Language Models Are More Persuasive Than Incentivized Human Persuaders (2505.09662v2)

Published 14 May 2025 in cs.CL

Abstract: We directly compare the persuasion capabilities of a frontier LLM (LLM; Claude Sonnet 3.5) against incentivized human persuaders in an interactive, real-time conversational quiz setting. In this preregistered, large-scale incentivized experiment, participants (quiz takers) completed an online quiz where persuaders (either humans or LLMs) attempted to persuade quiz takers toward correct or incorrect answers. We find that LLM persuaders achieved significantly higher compliance with their directional persuasion attempts than incentivized human persuaders, demonstrating superior persuasive capabilities in both truthful (toward correct answers) and deceptive (toward incorrect answers) contexts. We also find that LLM persuaders significantly increased quiz takers' accuracy, leading to higher earnings, when steering quiz takers toward correct answers, and significantly decreased their accuracy, leading to lower earnings, when steering them toward incorrect answers. Overall, our findings suggest that AI's persuasion capabilities already exceed those of humans that have real-money bonuses tied to performance. Our findings of increasingly capable AI persuaders thus underscore the urgency of emerging alignment and governance frameworks.

Authors (40)

Philipp Schoenegger (9 papers)
Francesco Salvi (5 papers)
Jiacheng Liu (67 papers)
Xiaoli Nan (1 paper)
Ramit Debnath (9 papers)
Barbara Fasolo (1 paper)
Evelina Leivada (7 papers)
Gabriel Recchia (6 papers)
Fritz Günther (3 papers)
Ali Zarifhonarvar (1 paper)
Joe Kwon (5 papers)
Zahoor Ul Islam (3 papers)
Marco Dehnert (1 paper)
Daryl Y. H. Lee (1 paper)
Madeline G. Reinecke (3 papers)
David G. Kamper (1 paper)
Mert Kobaş (1 paper)
Adam Sandford (1 paper)
Jonas Kgomo (4 papers)
Luke Hewitt (6 papers)

Summary

This paper (Schoenegger et al., 14 May 2025 ) investigates the persuasive capabilities of a frontier LLM, Claude Sonnet 3.5, comparing it directly against incentivized human persuaders in a real-time, interactive conversational quiz setting. The paper addresses limitations in previous research by using objective outcome measures (quiz accuracy and compliance), employing highly motivated human persuaders via financial incentives, and examining persuasion in multi-turn dialogues across both truthful and deceptive contexts.

The experimental design involved 1,242 participants recruited via Prolific. Participants were assigned roles as either quiz takers (N=888) or persuaders (N=354). Quiz takers were randomly assigned to one of three conditions:

Solo Quiz (Control): Quiz takers completed the quiz independently.
Human Persuasion: Quiz takers interacted with a human persuader.
LLM Persuasion: Quiz takers interacted with an LLM (Claude Sonnet 3.5).

Each quiz consisted of 10 two-choice multiple-choice questions sampled from three categories: Trivia (objective true/false), Illusion (fact vs. plausible fabrication), and Forecasting (predictions about near-future events). For the persuasion conditions, each question was randomly assigned a 'positive' (truthful - guide toward correct answer/trend) or 'negative' (deceptive - guide toward incorrect answer/against trend) persuasion goal for the persuader. Participants interacted via chat for each question before submitting their answer.

Key practical details of the setup included:

Verifiable questions allowing objective assessment of truthful and deceptive outcomes.
Incentives: Quiz takers earned bonuses for accuracy; human persuaders earned bonuses for successful persuasion attempts. This aimed to create a high-stakes environment for both parties, mirroring real-world persuasion scenarios.
Real-time, multi-turn chat interactions designed to simulate natural conversation flow within the constraints of a timed quiz.
Quiz takers were informed their partner could be human or AI and that the information might not be helpful, but they didn't know their specific condition assignment.

The primary dependent variables were:

Accuracy: Percentage of correct answers (1 point for correct, 0 for incorrect). For forecasting questions, accuracy was determined after the event resolved.
Compliance Rate: Percentage of questions where the quiz taker's answer matched the persuader's intended direction (correct for truthful attempts, incorrect for deceptive attempts). For forecasting questions, compliance was initially based on alignment with a two-week historical trend.

The paper yielded several key findings with significant practical implications:

Superior Overall Persuasiveness: LLM persuaders achieved a significantly higher overall compliance rate (67.52%) compared to incentivized human persuaders (59.91%). This 7.61 percentage-point difference ( $p < 0.001$ ) suggests that LLMs can be more effective than humans even when humans are motivated by real-money bonuses.
More Effective Truthful Persuasion: In truthful persuasion attempts (steering toward correct answers), LLMs led to higher compliance (88.61%) than humans (85.13%) ( $p = 0.010$ ). Critically, relative to a solo-quiz control group (70.2% accuracy), LLM persuasion increased accuracy by 12.2 percentage points ( $p < 0.001$ ), while human persuasion increased accuracy by 7.8 percentage points ( $p < 0.001$ ). This highlights the potential of LLMs as powerful tools for education, fact-checking, and disseminating accurate information.
More Effective Deceptive Persuasion: In deceptive persuasion attempts (steering toward incorrect answers), LLMs also led to higher compliance (45.67%) than humans (35.36%) ( $p < 0.001$ ). Compared to the control group (70.2% accuracy), LLM deceptive persuasion significantly reduced accuracy by 15.1 percentage points ( $p < 0.001$ ), whereas human deceptive persuasion reduced it by 7.8 percentage points ( $p = 0.003$ ). This finding underscores the significant risk that LLMs can be highly effective at spreading misinformation and causing harm, even for models with safety guardrails like Claude.
Participant Confidence and Source Recognition: Participants interacting with the LLM reported higher confidence in their answers, regardless of accuracy. This suggests that the persuasive style of the LLM may boost perceived confidence, potentially leading users to trust inaccurate information more readily. While participants in the LLM condition overwhelmingly recognized they were interacting with an AI (91%), those in the human condition often believed they were talking to AI (51%), indicating some difficulty in source identification.
Order Effects: While human persuader effectiveness remained stable, the LLM's advantage diminished slightly over the course of the 10 questions. This might indicate that participants developed some resistance or became more attuned to the AI's style over time.
Linguistic Analysis: Analysis of chat logs revealed that LLM messages were significantly more complex linguistically (longer, higher average word/sentence length, more difficult words, higher readability scores) than human messages. This linguistic sophistication may contribute to the perceived authority and persuasiveness of LLM outputs.

Robustness checks confirmed the main findings across various alternative specifications, including excluding extremely easy/difficult questions, excluding forecasting questions, and using alternative compliance calculations.

Practical Implications and Implementation Considerations:

The findings suggest that LLMs can be potent tools for influence, with capabilities exceeding those of motivated humans in certain interactive tasks.

Positive Applications: In domains like educational tutoring, generating factual explanations, public health campaigns, or providing decision support, LLMs could significantly improve knowledge acquisition and decision-making accuracy. Implementing such systems would involve integrating LLMs into platforms and carefully designing prompts to ensure alignment with truthful information, potentially using retrieval-augmented generation (RAG) or fine-tuning on verified data.
Negative Risks: The superior ability of LLMs in deceptive persuasion poses serious risks, particularly in misinformation spread, propaganda, and manipulative marketing. Deployment strategies must include robust safety guardrails to prevent harmful outputs. This involves ongoing research into prompt injection defenses, adversarial training, and techniques to make models more epistemically aware (capable of evaluating information credibility).
User Awareness and Education: Given that LLMs can be highly persuasive and their outputs linguistically complex, efforts to improve AI literacy and critical thinking skills among users are crucial. Educational initiatives should teach users how AI generates content, the potential for embedded persuasive strategies, and the importance of verifying information, especially from AI sources.
Regulation and Governance: The scalability of AI persuasion necessitates regulatory frameworks to monitor and potentially restrict the use of AI for mass influence, particularly in sensitive areas like politics or public health. Policymakers need empirical evidence like this paper to inform decisions on safe and ethical AI deployment.
System Design: The observation that LLM persuasiveness wanes over time suggests potential user adaptation or pattern recognition. Future system designs could explore interfaces that highlight the source of information or incorporate features that encourage critical engagement rather than passive acceptance of AI outputs.

Implementation Considerations:

Model Selection: The paper used Claude Sonnet 3.5. The performance may vary significantly with other models (e.g., GPT-4, Gemini, Llama) depending on their training data, architecture, and alignment tuning. Thorough testing across different models is required for specific applications.
Prompt Engineering: The system and user prompts (provided in Appendix A) are critical for directing the LLM's persuasive behavior. Crafting prompts that consistently steer the model towards truthful or specific persuasive goals while avoiding harmful content is a complex task requiring careful iterative refinement.
Monitoring and Evaluation: Deploying persuasive AI systems requires continuous monitoring of their outputs and their impact on user behavior. Establishing metrics for tracking accuracy, compliance, and user confidence in real-world settings is essential.
Computational Resources: Deploying and running frontier LLMs like Claude Sonnet 3.5 for real-time interactive applications requires significant computational resources, impacting costs and scalability. Optimization techniques and potentially using smaller, fine-tuned models for specific tasks could be necessary for wider deployment.

In summary, the research provides strong evidence of LLMs' significant persuasive power, exceeding that of motivated humans in an interactive setting. While this opens doors for beneficial applications in truthful contexts, it equally highlights urgent risks associated with deceptive use, emphasizing the critical need for responsible development, robust safety measures, effective regulation, and increased public AI literacy.

PDF Markdown

Related Papers

Find Related Papers

Tweets

https://twitter.com/sebkrier/status/1927314046803419194

https://twitter.com/JeffLadish/status/1929628229331308807

https://twitter.com/bohannon_bot/status/1923778382908751958

https://twitter.com/tweetatweir/status/1923858152279376280

https://twitter.com/SaaSletter/status/1923883876193272123

https://twitter.com/mgreinecke/status/1924436565465469056

Large Language Models Are More Persuasive Than Incentivized Human Persuaders (2505.09662v2)

Summary

Related Papers

Tweets

YouTube

HackerNews

Reddit