Scaling behavior of SOP to significantly larger robot fleets
Determine whether the near-linear scaling in wall-clock training efficiency observed under the Scalable Online Post-training (SOP) framework when increasing the number of robot actors continues to hold for significantly larger robot fleets during online, distributed, multi-task post-training of generalist Vision-Language-Action policies in the physical world.
Sponsor
References
Whether near-linear scaling extends to significantly larger fleets, and how to support continual acquisition of new skills without catastrophic forgetting, are open questions.
— SOP: A Scalable Online Post-Training System for Vision-Language-Action Models
(2601.03044 - Pan et al., 6 Jan 2026) in Discussion and Future Work