Online adaptation of direct data-driven LQR methods
Develop a direct data-driven methodology that adaptively updates the linear quadratic regulator (LQR) state-feedback gain from online closed-loop data, rather than relying on offline or episodic data batches, so as to enable real-time controller adaptation.
References
Direct data-driven design methods for the linear quadratic regulator (LQR) mainly use offline or episodic data batches, and their online adaptation has been acknowledged as an open problem.
— Data-Enabled Policy Optimization for Direct Adaptive Learning of the LQR
(2401.14871 - Zhao et al., 26 Jan 2024) in Abstract