Papers

Topics

Authors

Recent

View all

Assistant

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 72 tok/s

Gemini 2.5 Pro 45 tok/s Pro

GPT-5 Medium 33 tok/s Pro

GPT-5 High 29 tok/s Pro

GPT-4o 93 tok/s Pro

Kimi K2 211 tok/s Pro

GPT OSS 120B 442 tok/s Pro

Claude Sonnet 4.5 35 tok/s Pro

2000 character limit reached

Convergence Analysis for Entropy-Regularized Control Problems: A Probabilistic Approach (2406.10959v4)

Published 16 Jun 2024 in math.OC and cs.LG

Abstract: In this paper we investigate the convergence of the Policy Iteration Algorithm (PIA) for a class of general continuous-time entropy-regularized stochastic control problems. In particular, instead of employing sophisticated PDE estimates for the iterative PDEs involved in the algorithm (see, e.g., Huang-Wang-Zhou(2025)), we shall provide a simple proof from scratch for the convergence of the PIA. Our approach builds on probabilistic representation formulae for solutions of PDEs and their derivatives. Moreover, in the finite horizon model and in the infinite horizon model with large discount factor, the similar arguments lead to a super-exponential rate of convergence without tear. Finally, with some extra efforts we show that our approach can be extended to the diffusion control case in the one dimensional setting, also with a super-exponential rate of convergence.