Bail-Method Sensitivity Mechanism
Determine whether differences between system-prompt versus user-prompt instructions or other factors explain why different bail methods (bail tool, bail string, and bail prompt) lead models to bail on different subsets of prompts, and characterize the mechanism responsible for this method sensitivity.
References
So we consider this still unresolved. This sensitivity doesn't matter for any of our results, but it is an important open question that deserves further investigation.
— The LLM Has Left The Chat: Evidence of Bail Preferences in Large Language Models
(2509.04781 - Ensign et al., 5 Sep 2025) in Section 5.1: Method Sensitivity