Helpful assistant or fruitful facilitator? Investigating how personas affect language model behavior (2407.02099v1)

Published 2 Jul 2024 in cs.CL

Abstract: One way to personalize and steer generations from LLMs (LLM) is to assign a persona: a role that describes how the user expects the LLM to behave (e.g., a helpful assistant, a teacher, a woman). This paper investigates how personas affect diverse aspects of model behavior. We assign to seven LLMs 162 personas from 12 categories spanning variables like gender, sexual orientation, and occupation. We prompt them to answer questions from five datasets covering objective (e.g., questions about math and history) and subjective tasks (e.g., questions about beliefs and values). We also compare persona's generations to two baseline settings: a control persona setting with 30 paraphrases of "a helpful assistant" to control for models' prompt sensitivity, and an empty persona setting where no persona is assigned. We find that for all models and datasets, personas show greater variability than the control setting and that some measures of persona behavior generalize across models.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (40)

Authors (2)

Pedro Henrique Luz de Araujo (8 papers)
Benjamin Roth (47 papers)

Citations (3)

View on Semantic Scholar

Tweets

https://twitter.com/MilaNLProc/status/1846864233381683631

Helpful assistant or fruitful facilitator? Investigating how personas affect language model behavior (2407.02099v1)

Related Papers

Tweets