Papers
Topics
Authors
Recent
Search
2000 character limit reached

Method of Successive Approximations for Stochastic Optimal Control: Contractivity and Convergence

Published 11 May 2024 in math.OC, cs.NA, math.DS, math.NA, and math.PR | (2405.07048v1)

Abstract: The Method of Successive Approximations (MSA) is a fixed-point iterative method used to solve stochastic optimal control problems. It is an indirect method based on the conditions derived from the Stochastic Maximum Principle (SMP), an extension of the Pontryagin Maximum Principle (PMP) to stochastic control problems. In this study, we investigate the contractivity and the convergence of MSA for a specific and interesting class of stochastic dynamical systems (when the drift coefficient is one-sided-Lipschitz with a negative constant and the diffusion coefficient is Lipschitz continuous). Our analysis unfolds in three key steps: firstly, we prove the stability of the state process with respect to the control process. Secondly, we establish the stability of the adjoint process. Finally, we present rigorous evidence to prove the contractivity and then the convergence of MSA. This study contributes to enhancing the understanding of MSA's applicability and effectiveness in addressing stochastic optimal control problems.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (4)
  1. Q. Li, L. Chen, C. Tai, and W. E, “Maximum principle based algorithms for deep learning,” Journal of Machine Learning Research, vol. 18, no. 165, pp. 1–29, 2018. [Online]. Available: http://jmlr.org/papers/v18/17-653.html
  2. K. D. Smith and F. Bullo, “Contractivity of the method of successive approximations for optimal control,” IEEE Control Systems Letters, vol. 7, pp. 919–924, 2022.
  3. H. Schurz, “A brief review on stability investigations of numerical methods for systems of stochastic differential equations,” Networks and Heterogeneous Media, vol. 19, no. 1, pp. 355–383, 2024. [Online]. Available: https://www.aimspress.com/article/doi/10.3934/nhm.2024016
  4. D. J. Higham, X. Mao, and A. M. Stuart, “Exponential mean-square stability of numerical solutions to stochastic differential equations,” LMS Journal of Computation and Mathematics, vol. 6, p. 297–313, 2003.

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.