Necessity of Broad and Rich Simulated Environments for Scaling Embodied Agents
Determine whether obtaining sufficient breadth and richness across many simulated environments and commercial video games is necessary to effectively scale embodied agents that connect language, perception, and action. This investigation should assess the necessity claim by evaluating agents trained on narrow versus broad and rich distributions of environments and tasks, and quantify impacts on generalization, robustness, and capability scaling.
References
Instead, SIMA makes progress towards embodied AI by leveraging many simulated environments and commercial video games to obtain the sufficient breadth and richness that we conjecture to be necessary for effectively scaling embodied agents---with the hope that lessons learned (and possibly even the agents themselves) will be applicable to robotic embodiments in the future.