Long-term social alignment and behavioral adaptation in human–AI interaction
Determine whether AI agents built on large language models, when exposed to sustained interactions with humans over time in dynamic, multi-user environments, develop shared norms, adapt to user values, or exhibit behavioral drift.
References
Finally, it remains unclear whether AI agents exposed to humans over time develop shared norms, adapt to user values, or exhibit behavioral drift, which raises important questions about the long-term social alignment of AI in dynamic, multi-user environments.
— AI Agent Behavioral Science
(2506.06366 - Chen et al., 4 Jun 2025) in Section 4, Summary (Emergent AI Agent Behaviors in Human-Agent Interaction)