Improving the LLM Orchestrator for Embodied Robotics
Determine effective methods to improve the large language model-based orchestrator that performs high-level planning and social interaction in hierarchical robotic systems combining an LLM orchestrator with a Vision Language Action (VLA) executor, so that the orchestrator achieves higher practical intelligence on real-world tasks such as those evaluated by Butter-Bench.
References
How to improve the orchestrator remains a question for future research, but the fact that Gemini ER 1.5 is not better than Gemini 2.5 Pro suggests social capabilities are not improved by training on the type of robotic data Gemini ER is trained on.
— Butter-Bench: Evaluating LLM Controlled Robots for Practical Intelligence
(2510.21860 - Sharrock et al., 23 Oct 2025) in Section Future Work