Scaling laws for 3D VLM pretraining and downstream robot control
Investigate and characterize the scaling laws that relate SPEAR-1’s downstream robot control performance to the quantity and quality of the 3D visual question answering pretraining data used to train SPEAR-VLM, in order to understand how data scale and data quality affect task performance.
Sponsor
References
While we have showed the benefits of 3D VLM pre-training on downstream robot control tasks, the scaling laws relating the latter to the quantity and quality of 3D pre-training data are still not well understood.
— SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding
(2511.17411 - Nikolov et al., 21 Nov 2025) in Section: Discussion and Limitations