Method of Successive Approximations for Stochastic Optimal Control: Contractivity and Convergence
Abstract: The Method of Successive Approximations (MSA) is a fixed-point iterative method used to solve stochastic optimal control problems. It is an indirect method based on the conditions derived from the Stochastic Maximum Principle (SMP), an extension of the Pontryagin Maximum Principle (PMP) to stochastic control problems. In this study, we investigate the contractivity and the convergence of MSA for a specific and interesting class of stochastic dynamical systems (when the drift coefficient is one-sided-Lipschitz with a negative constant and the diffusion coefficient is Lipschitz continuous). Our analysis unfolds in three key steps: firstly, we prove the stability of the state process with respect to the control process. Secondly, we establish the stability of the adjoint process. Finally, we present rigorous evidence to prove the contractivity and then the convergence of MSA. This study contributes to enhancing the understanding of MSA's applicability and effectiveness in addressing stochastic optimal control problems.
- Q. Li, L. Chen, C. Tai, and W. E, “Maximum principle based algorithms for deep learning,” Journal of Machine Learning Research, vol. 18, no. 165, pp. 1–29, 2018. [Online]. Available: http://jmlr.org/papers/v18/17-653.html
- K. D. Smith and F. Bullo, “Contractivity of the method of successive approximations for optimal control,” IEEE Control Systems Letters, vol. 7, pp. 919–924, 2022.
- H. Schurz, “A brief review on stability investigations of numerical methods for systems of stochastic differential equations,” Networks and Heterogeneous Media, vol. 19, no. 1, pp. 355–383, 2024. [Online]. Available: https://www.aimspress.com/article/doi/10.3934/nhm.2024016
- D. J. Higham, X. Mao, and A. M. Stuart, “Exponential mean-square stability of numerical solutions to stochastic differential equations,” LMS Journal of Computation and Mathematics, vol. 6, p. 297–313, 2003.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.