Systematic emergence of recovery behaviors in VLA models
Determine whether recovery behaviors systematically emerge in most deployment settings for generalist Vision-Language-Action (VLA) robot control models, and establish a rigorous evaluation procedure—such as plotting test-time scaling curves of recovery maneuvers—to assess the presence and consistency of this emergence across tasks and environments.
References
Finally, while prior results do show some examples of recovery behaviors in VLA models, it is unclear if such behaviors systematically emerge in most settings or not, and studying this aspect rigorously (for example, by plotting test-time scaling curves analogous to Figure~\ref{fig:lid_retries_vs_success}) is also useful for the community.
                — RaC: Robot Learning for Long-Horizon Tasks by Scaling Recovery and Correction
                
                (2509.07953 - Hu et al., 9 Sep 2025) in Section 6: Discussion, Conclusion, and Future Work (Future work paragraph)