Fundamental sample-efficiency gap for robust control

Prove or refute the conjecture that robust control incurs a fundamental asymptotic sample-efficiency gap relative to certainty equivalence and domain randomization in learning the linear quadratic regulator, by developing lower bounds that preclude the 1/N trace rate achieved by CE and DR.

Background

The paper’s upper bounds show that RC’s leading term depends on dθ and the operator norm of H(θ⋆)FI(θ⋆)^{-1}, while CE and DR match the optimal 1/N trace rate. The authors conjecture the RC gap is fundamental, reflecting its conservative worst-case design.

A definitive lower bound would formalize RC’s intrinsic inefficiency relative to CE and DR and guide the choice of synthesis methods in low- versus high-data regimes.

References

We conjecture that this gap is fundamental, due to the conservative nature of robust control.

— Domain Randomization is Sample Efficient for Linear Quadratic Control (2502.12310 - Fujinami et al., 17 Feb 2025) in Subsection Contributions (Section 1) and Discussion (Theoretical extensions)

Fundamental sample-efficiency gap for robust control

Background

References

Related Problems