Reliable real-world deployment of large-scale VLA policies
Establish reliable methods for deploying large-scale vision-language-action policies, including OpenVLA, pi_0, MolmoAct, and Gemini Robotics, in real-world robotic settings.
References
Large-scale vision-language-action policies such as OpenVLA, pi_0, MolmoAct and Gemini Robotics have demonstrated strong language-conditioned manipulation capabilities across diverse embodiments, yet reliably deploying them in real-world settings remains an open problem.
— TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics
(2602.19313 - Chen et al., 22 Feb 2026) in Related Work — The reward bottleneck for VLA.