Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models (2502.20332v2)

Published 27 Feb 2025 in cs.CL and cs.AI

Abstract: Many recent studies have found evidence for emergent reasoning capabilities in LLMs, but debate persists concerning the robustness of these capabilities, and the extent to which they depend on structured reasoning mechanisms. To shed light on these issues, we study the internal mechanisms that support abstract reasoning in LLMs. We identify an emergent symbolic architecture that implements abstract reasoning via a series of three computations. In early layers, symbol abstraction heads convert input tokens to abstract variables based on the relations between those tokens. In intermediate layers, symbolic induction heads perform sequence induction over these abstract variables. Finally, in later layers, retrieval heads predict the next token by retrieving the value associated with the predicted abstract variable. These results point toward a resolution of the longstanding debate between symbolic and neural network approaches, suggesting that emergent reasoning in neural networks depends on the emergence of symbolic mechanisms.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/Daniel_Van_Zant/status/1901651104905003474

https://twitter.com/tmramalho/status/1942060034642374922

Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models (2502.20332v2)

Summary

Related Papers

Tweets