Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Model-free stochastic linear quadratic design by semidefinite programming (2412.17230v1)

Published 23 Dec 2024 in math.OC

Abstract: In this article, we study a model-free design approach for stochastic linear quadratic (SLQ) controllers. Based on the convexity of the SLQ dual problem and the Karush-Kuhn-Tucker (KKT) conditions, we find the relationship between the optimal point of the dual problem and the Q-function, which can be used to develop a novel model-free semidefinite programming (SDP) algorithm for deriving optimal control gain. This study provides a new optimization perspective for understanding Q-learning algorithms and lays a theoretical foundation for effective reinforcement learning (RL) algorithms. Finally, the effectiveness of the proposed model-free SDP algorithm is demonstrated by two case simulations.

Summary

We haven't generated a summary for this paper yet.