Convergence analysis of controlled particle systems arising in deep learning: from finite to infinite sample size (2404.05185v3)
Abstract: This paper deals with a class of neural SDEs and studies the limiting behavior of the associated sampled optimal control problems as the sample size grows to infinity. The neural SDEs with $N$ samples can be linked to the $N$-particle systems with centralized control. We analyze the Hamilton-Jacobi-BeLLMan equation corresponding to the $N$-particle system and establish regularity results which are uniform in $N$. The uniform regularity estimates are obtained by the stochastic maximum principle and the analysis of a backward stochastic Riccati equation. Using these uniform regularity results, we show the convergence of the minima of the objective functionals and optimal parameters of the neural SDEs as the sample size $N$ tends to infinity. The limiting objects can be identified with suitable functions defined on the Wasserstein space of Borel probability measures. Furthermore, quantitative convergence rates are also obtained.
- R. Carmona and M. Laurière. Deep learning for mean field games and mean field control with applications to finance. arXiv:2107.04568, 2021.
- L. Chizat and F. Bach. On the global convergence of gradient descent for over-parameterized models using optimal transport. 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), 2018.
- P. Henry-Labordère. Deep primal-dual algorithm for bsdes: Applications of machine learning to cva and im. SSRN: 3071506, 2017.