Toward Mechanistic Explanation of Deductive Reasoning in Language Models (2510.09340v1)

Published 10 Oct 2025 in cs.AI and cs.CL

Abstract: Recent LLMs have demonstrated relevant capabilities in solving problems that require logical reasoning; however, the corresponding internal mechanisms remain largely unexplored. In this paper, we show that a small LLM can solve a deductive reasoning task by learning the underlying rules (rather than operating as a statistical learner). A low-level explanation of its internal representations and computational circuits is then provided. Our findings reveal that induction heads play a central role in the implementation of the rule completion and rule chaining steps involved in the logical inference required by the task.