"It's a Fair Game", or Is It? Examining How Users Navigate Disclosure Risks and Benefits When Using LLM-Based Conversational Agents (2309.11653v2)

Published 20 Sep 2023 in cs.HC, cs.AI, and cs.CR

Abstract: The widespread use of LLM-based conversational agents (CAs), especially in high-stakes domains, raises many privacy concerns. Building ethical LLM-based CAs that respect user privacy requires an in-depth understanding of the privacy risks that concern users the most. However, existing research, primarily model-centered, does not provide insight into users' perspectives. To bridge this gap, we analyzed sensitive disclosures in real-world ChatGPT conversations and conducted semi-structured interviews with 19 LLM-based CA users. We found that users are constantly faced with trade-offs between privacy, utility, and convenience when using LLM-based CAs. However, users' erroneous mental models and the dark patterns in system design limited their awareness and comprehension of the privacy risks. Additionally, the human-like interactions encouraged more sensitive disclosures, which complicated users' ability to navigate the trade-offs. We discuss practical design guidelines and the needs for paradigm shifts to protect the privacy of LLM-based CA users.

References (75)

Authors (8)

Zhiping Zhang (9 papers)
Michelle Jia (1 paper)
Bingsheng Yao (49 papers)
Sauvik Das (13 papers)
Ada Lerner (4 papers)
Dakuo Wang (87 papers)
Tianshi Li (22 papers)
Hao-Ping Lee (3 papers)

Citations (35)

View on Semantic Scholar

Summary

Examining Disclosure Risks in LLM-Based Conversational Agents

The rapid adoption of LLM-based conversational agents (CAs), such as ChatGPT, in sensitive domains like healthcare and finance presents significant privacy challenges. The paper "It's a Fair Game", or Is It? Examining How Users Navigate Disclosure Risks and Benefits When Using LLM-Based Conversational Agents" by Zhang et al. offers a comprehensive examination of how users interact with these systems, revealing the nuanced balance between utility, convenience, and privacy.

User Disclosure Patterns and Privacy Trade-offs

The paper employed both an analysis of real-world ChatGPT conversations and semi-structured interviews with users. The findings indicate that users struggle with the trade-offs between the benefits of CA usage and the associated privacy risks. Users' mental models often underestimate the privacy concerns, potentially due to erroneous assumptions about how these systems work. The research identified that users perceive varying levels of data sensitivity and adopt ad-hoc protective measures, such as falsifying data or providing only general information. However, these actions are sporadic, revealing a lack of coherent privacy strategies.

Misalignment of User Expectations and System Transparency

A significant portion of the research explores users' mental models of LLM-based systems, which influence how they perceive and manage privacy risks. The authors found that users often hold flawed mental models about how these systems utilize input data for response generation and training. This misalignment hinders users' ability to navigate privacy risks effectively and points to a crucial gap between system transparency and user expectations. Such misunderstanding can lead users to disclose more information than they otherwise would if adequately informed.

Institutional and Interdependent Privacy Concerns

The research highlights two main privacy risks: institutional (e.g., data misuse by companies) and interdependent (e.g., sharing data about others). The interdependent risk is particularly complex because users often disclose information about third parties. Interestingly, these disclosures occur in contexts where users might not fully appreciate the ramifications, underscoring the necessity for CAs to better manage and contextualize information sharing.

System Design Recommendations

Based on the insights gathered, the authors emphasize the importance of privacy-aware system design. They advocate for more granular opt-out controls and context-sensitive privacy features that can assist users in making more informed decisions about data sharing. Such enhancements should align with improved user education on how LLMs function to mitigate over- or under-sharing due to inaccurate mental models.

Conclusion and Future Directions

The paper provides essential groundwork in understanding the privacy dynamics faced by users of LLM-based CAs. Although the authors acknowledge the inherent surveillance architecture of modern LLMs, they call for a paradigmatic shift in designing these systems with privacy-by-design principles. They suggest that future research should explore user mental models further and develop user-centric privacy-preserving technologies. Engaging with these unresolved issues will be critical to crafting systems that respect user privacy without compromising the functionality these technologies offer.

PDF Markdown

Tweets

YouTube

Show All Videos