Discrete CoT step complexity for reachability with constant-depth transformers
Determine whether a constant-depth transformer using discrete chain-of-thought can solve directed graph reachability on a graph with n vertices with strictly fewer than O(n^2) discrete CoT decoding steps, and, if so, quantify the minimal number of steps required.
References
For a constant-depth transformer, \citet{merrill2023expressive} shows directed graph reachability can be solved with $O(n2)$ CoT steps where $n$ is the number of vertices, while it remains unclear whether a smaller number of discrete CoT steps can solve the task.
— Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought
(2505.12514 - Zhu et al., 18 May 2025) in Section 1.1 (Related works), paragraph "Reasoning as graph problems"