Dice Question Streamline Icon: https://streamlinehq.com

Do Reality-Gap-Reduction Methods Limit Simulation Data Scaling for Large Robotics Models?

Ascertain whether the methods and assumptions required to reduce the reality gap in synthetic data generation inherently constrain the scalability of simulation-based dataset collection for training large robotic models.

Information Square Streamline Icon: https://streamlinehq.com

Background

Large behavior and foundation models for robotics increasingly depend on vast datasets; simulation offers a route to scale data generation. However, for synthetic data to be useful, its reality gap must be minimized through methods such as high-fidelity asset creation, accurate sensor modeling, and physics calibration.

The authors question whether these gap-reduction techniques—and the assumptions they entail—limit the achievable scale of simulated data, potentially constraining the training of large models.

References

An open question is whether the methods and assumptions needed to reduce these gaps would themselves be limiting factors to the scale of data that can be collected in simulation (under these assumptions and using these methods).

The Reality Gap in Robotics: Challenges, Solutions, and Best Practices (2510.20808 - Aljalbout et al., 23 Oct 2025) in Section 7.5 (Simulation for Large Robotics Models)