Practical seriality of sequential decision problems in RL
Ascertain whether computing optimal policies in practical sequential decision problems, modeled as Markov Decision Processes, is inherently serial in the sense of lying outside the threshold-circuit class TC and therefore requiring serial computation that cannot be efficiently parallelized.
References
However, in practice, it remains unclear whether this problem is likely serial.
— The Serial Scaling Hypothesis
(2507.12549 - Liu et al., 16 Jul 2025) in Section 4.4, Sequential Decision Problems