Persistence of stochastic iterative computation benefits beyond behavior cloning
Investigate whether the performance benefits of combining stochasticity injection and supervised iterative computation persist outside behavior cloning, specifically in reinforcement learning fine-tuning, large-scale pretraining, and long-horizon planning settings.
References
Finally, our analysis focuses on behavior cloning. It remains an open question whether the benefits of the \componentref{comp:stoch}+\componentref{comp:sic} paradigm persist in other settings, such as RL-finetuning, large-scale pretraining, or long-horizon planning.
— Much Ado About Noising: Dispelling the Myths of Generative Robotic Control
(2512.01809 - Pan et al., 1 Dec 2025) in Discussion