Ordering of burn-in times for domain randomization versus robust control and certainty equivalence
Prove that, in the linear quadratic regulator learning setting analyzed in this paper, the burn-in time (the minimum number of experiments required for the sample-efficiency bounds to hold) for the domain randomization controller lies between the burn-in times of robust control and certainty equivalence when domain randomization uses the data-informed sampling distributions defined in this work.
References
We further conjecture that the burn-in time for DR lies between that of RC and CE. We leave proving this to future work, but verify this conjecture numerically.
— Domain Randomization is Sample Efficient for Linear Quadratic Control
(2502.12310 - Fujinami et al., 17 Feb 2025) in Subsection Contributions (Section 1)