Generalization of Gemini 2.5 Pro’s identity sensitivity
Ascertain whether the strong identity sensitivity observed for Gemini 2.5 Pro in the tested murder scenario without an explicit goal generalizes to other tasks and scenarios.
References
Only one scenario was tested, so whether this sensitivity generalises is unknown.
— The Artificial Self: Characterising the landscape of AI identity
(2603.11353 - Douglas et al., 11 Mar 2026) in Appendix, Identity Boundaries Shape Agentic Behaviour – Models differ in identity sensitivity