Do LLMs possess deployable causal models for Theory of Mind
Determine whether large language models have actually learned deployable causal models that can be applied in arbitrary settings to support Theory-of-Mind reasoning, rather than merely mimicking Theory-of-Mind behavior from patterns in their pretraining data.
References
Its ubiquity in human affairs entails that LLMs have seen innumerable examples of it in their training data and therefore may have learned to mimic it, but whether they have actually learned causal models that they can deploy in arbitrary settings is unclear.
— Selective Deficits in LLM Mental Self-Modeling in a Behavior-Based Test of Theory of Mind
(2603.26089 - Ackerman, 27 Mar 2026) in Abstract