Do Serious AI Harms Require Difficult Reasoning
Determine whether the most serious harms posed by advanced AI systems in deployment settings in fact require difficult reasoning to execute, or whether such harms can be carried out without extended reasoning and working memory, for example in scenarios like self-exfiltration or sabotage.
References
Finally, it remains an open question whether the most serious harms in fact require difficult reasoning.
                — Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety
                
                (2507.11473 - Korbak et al., 15 Jul 2025) in Section 1.1, Thinking Out Loud is Necessary for Hard Tasks