Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI (2404.15058v1)

Published 23 Apr 2024 in cs.CY and cs.AI

Abstract: Recent generative AI systems have demonstrated more advanced persuasive capabilities and are increasingly permeating areas of life where they can influence decision-making. Generative AI presents a new risk profile of persuasion due the opportunity for reciprocal exchange and prolonged interactions. This has led to growing concerns about harms from AI persuasion and how they can be mitigated, highlighting the need for a systematic study of AI persuasion. The current definitions of AI persuasion are unclear and related harms are insufficiently studied. Existing harm mitigation approaches prioritise harms from the outcome of persuasion over harms from the process of persuasion. In this paper, we lay the groundwork for the systematic study of AI persuasion. We first put forward definitions of persuasive generative AI. We distinguish between rationally persuasive generative AI, which relies on providing relevant facts, sound reasoning, or other forms of trustworthy evidence, and manipulative generative AI, which relies on taking advantage of cognitive biases and heuristics or misrepresenting information. We also put forward a map of harms from AI persuasion, including definitions and examples of economic, physical, environmental, psychological, sociocultural, political, privacy, and autonomy harm. We then introduce a map of mechanisms that contribute to harmful persuasion. Lastly, we provide an overview of approaches that can be used to mitigate against process harms of persuasion, including prompt engineering for manipulation classification and red teaming. Future work will operationalise these mitigations and study the interaction between different types of mechanisms of persuasion.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (20)
  1. Seliem El-Sayed (4 papers)
  2. Canfer Akbulut (7 papers)
  3. Amanda McCroskery (5 papers)
  4. Geoff Keeling (11 papers)
  5. Zachary Kenton (18 papers)
  6. Zaria Jalan (4 papers)
  7. Nahema Marchal (11 papers)
  8. Arianna Manzini (5 papers)
  9. Toby Shevlane (7 papers)
  10. Shannon Vallor (4 papers)
  11. Daniel Susser (1 paper)
  12. Matija Franklin (17 papers)
  13. Sophie Bridgers (4 papers)
  14. Harry Law (2 papers)
  15. Matthew Rahtz (11 papers)
  16. Murray Shanahan (46 papers)
  17. Michael Henry Tessler (13 papers)
  18. Arthur Douillard (20 papers)
  19. Tom Everitt (39 papers)
  20. Sasha Brown (5 papers)
Citations (11)