Extent of LLM propositional reasoning-based imagery capacity beyond human working memory limits
Ascertain the extent of propositional reasoning-based mental imagery capacity in large language models when unconstrained by human working memory limits by evaluating performance on instruction sets with substantially more than 3–5 steps and more than 4 imagined objects and characterizing how accuracy scales with task complexity.
References
Without this limitation, it is unknown the extent of LLMs propositional reasoning capacity.
— Artificial Phantasia: Evidence for Propositional Reasoning-Based Mental Imagery in Large Language Models
(2509.23108 - McCarty et al., 27 Sep 2025) in Section "Future Work"