Adaptive autoregressive depth for Ordered Action Tokenization
Develop an online method for autoregressive policies that use Ordered Action Tokenization (OAT) to adaptively select the number of generated tokens at inference time by (i) estimating the intrinsic complexity of the current action chunk and (ii) deciding whether generating additional OAT tokens will meaningfully reduce uncertainty in the detokenized continuous action chunk.
References
Estimating action complexity online and deciding when additional tokens meaningfully reduce uncertainty remains an open problem.
— OAT: Ordered Action Tokenization
(2602.04215 - Liu et al., 4 Feb 2026) in Discussion and Limitations