Papers
Topics
Authors
Recent
Search
2000 character limit reached

SOC-MartNet: A Martingale Neural Network for the Hamilton-Jacobi-Bellman Equation without Explicit inf H in Stochastic Optimal Controls

Published 6 May 2024 in math.NA and cs.NA | (2405.03169v4)

Abstract: In this paper, we propose a martingale-based neural network, SOC-MartNet, for solving high-dimensional Hamilton-Jacobi-Bellman (HJB) equations where no explicit expression is needed for the infimum of the Hamiltonian, $\inf_{u \in U} H(t,x,u, z,p)$, and stochastic optimal control problems (SOCPs) with controls on both drift and volatility. We reformulate the HJB equations for the value function by training two neural networks, one for the value function and one for the optimal control with the help of two stochastic processes - a Hamiltonian process and a cost process. The control and value networks are trained such that the associated Hamiltonian process is minimized to satisfy the minimum principle of a feedback SOCP, and the cost process becomes a martingale, thus, ensuring the value function network as the solution to the corresponding HJB equation. Moreover, to enforce the martingale property for the cost process, we employ an adversarial network and construct a loss function characterizing the projection property of the conditional expectation condition of the martingale. Numerical results show that the proposed SOC-MartNet is effective and efficient for solving HJB-type equations and SOCPs with a dimension up to 10,000 in a small number of iteration steps (less than 6000) of training.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (51)
  1. Deep neural networks algorithms for stochastic control problems on finite horizon: numerical applications. Methodol. Comput. Appl. Probab., 24(1):143–178, 2022.
  2. On the convergence rate of approximation schemes for Hamilton-Jacobi-Bellman equations. M2AN Math. Model. Numer. Anal., 36(1):33–54, 2002.
  3. Approximate solutions to the time-invariant Hamilton-Jacobi-Bellman equation. J. Optim. Theory Appl., 96(3):589–626, 1998.
  4. Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation. Automatica J. IFAC, 33(12):2159–2177, 1997.
  5. Richard Bellman. Dynamic programming. Princeton University Press, Princeton, NJ, 1957.
  6. A patchy dynamic programming scheme for a class of Hamilton-Jacobi-Bellman equations. SIAM J. Sci. Comput., 34(5):A2625–A2649, 2012.
  7. Wei Cai. DeepMartNet – a martingale based deep neural network learning algorithm for eigenvalue/BVP problems and optimal stochastic controls, 2023, arXiv:2307.11942 [math.NA].
  8. DeepMartNet – a martingale based deep neural network learning method for Dirichlet BVP and eigenvalue problems of elliptic pdes, 2023, arXiv:2311.09456 [math.NA].
  9. User’s guide to viscosity solutions of second order partial differential equations. Bull. Amer. Math. Soc. (N.S.), 27(1):1–67, 1992.
  10. Viscosity solutions of Hamilton-Jacobi equations. Trans. Amer. Math. Soc., 277(1):1–42, 1983.
  11. Overcoming the curse of dimensionality for some Hamilton-Jacobi partial differential equations via neural network architectures. Res. Math. Sci., 7(3):Paper No. 20, 50, 2020.
  12. On some neural network architectures that can represent viscosity solutions of certain high dimensional Hamilton-Jacobi partial differential equations. J. Comput. Phys., 425:Paper No. 109907, 16, 2021.
  13. Tensor decomposition methods for high-dimensional Hamilton-Jacobi-Bellman equations. SIAM J. Sci. Comput., 43(3):A1625–A1650, 2021.
  14. Deep learning-based numerical methods for high-dimensional parabolic partial differential equations and backward stochastic differential equations. Commun. Math. Stat., 5(4):349–380, 2017.
  15. Algorithms for solving high dimensional PDEs: from nonlinear Monte Carlo to machine learning. Nonlinearity, 35(1):278–310, 2022.
  16. Deterministic and stochastic optimal control, volume No. 1 of Applications of Mathematics. Springer-Verlag, Berlin-New York, 1975.
  17. Highly accurate numerical schemes for stochastic optimal control via FBSDEs. Numer. Math. Theory Methods Appl., 13(2):296–319, 2020.
  18. Jean-François Le Gall. Brownian Motion, Martingales, and Stochastic Calculus. Springer Cham, 2016.
  19. Failure-informed adaptive sampling for PINNs. SIAM J. Sci. Comput., 45(4):A1971–A1994, 2023.
  20. An efficient gradient projection method for stochastic optimal control problems. SIAM J. Numer. Anal., 55(6):2982–3005, 2017.
  21. Monte Carlo fPINNs: deep learning method for forward and inverse problems involving high dimensional fractional partial differential equations. Comput. Methods Appl. Mech. Engrg., 400:Paper No. 115523, 17, 2022.
  22. Deep learning approximation for stochastic control problems. Deep Reinforcement Learning Workshop, NIPS, 2016, arXiv:1611.07422 [cs.LG].
  23. Deep neural networks algorithms for stochastic control problems on finite horizon: convergence analysis. SIAM J. Numer. Anal., 59(1):525–557, 2021.
  24. Deep backward schemes for high-dimensional nonlinear PDEs. Math. Comp., 89(324):1547–1579, 2020.
  25. Hitoshi Ishii. On uniqueness and existence of viscosity solutions of fully nonlinear second-order elliptic PDEs. Comm. Pure Appl. Math., 42(1):15–45, 1989.
  26. Robert Jensen. The maximum principle for viscosity solutions of fully nonlinear second order partial differential equations. Arch. Rational Mech. Anal., 101(1):1–27, 1988.
  27. Solving stochastic optimal control problem via stochastic maximum principle with deep learning method. J. Sci. Comput., 93(1):Paper No. 30, 28, 2022.
  28. Polynomial approximation of high-dimensional Hamilton-Jacobi-Bellman equations and applications to feedback control of semilinear parabolic PDEs. SIAM J. Sci. Comput., 40(2):A629–A652, 2018.
  29. Mitigating the curse of dimensionality: sparse grid characteristics method for optimal feedback control and HJB equations. Comput. Optim. Appl., 68(2):289–315, 2017.
  30. A numerical algorithm for fully nonlinear HJB equations: an approach by control randomization. Monte Carlo Methods Appl., 20(2):145–165, 2014.
  31. Discrete time approximation of fully nonlinear HJB equations via BSDEs with nonpositive jumps. Ann. Appl. Probab., 25(4):2301–2338, 2015.
  32. Feynman-Kac representation for Hamilton-Jacobi-Bellman IPDE. Ann. Probab., 43(4):1823–1865, 2015.
  33. Achim Klenke. Probability Theory. Springer Cham, third edition, 2020.
  34. N. V. Krylov. Controlled diffusion processes, volume 14 of Applications of Mathematics. Springer-Verlag, New York-Berlin, 1980. Translated from the Russian by A. B. Aries.
  35. N. V. Krylov. Nonlinear elliptic and parabolic equations of the second order, volume 7 of Mathematics and its Applications (Soviet Series). D. Reidel Publishing Co., Dordrecht, 1987. Translated from the Russian by P. L. Buzytsky [P. L. Buzytskiĭ].
  36. HJB-POD-based feedback design for the optimal control of evolution problems. SIAM J. Appl. Dyn. Syst., 3(4):701–722, 2004.
  37. Pierre-Louis Lions. Generalized solutions of Hamilton-Jacobi equations, volume 69 of Research Notes in Mathematics. Pitman (Advanced Publishing Program), Boston, Mass.-London, 1982.
  38. Adaptive deep learning for high-dimensional Hamilton-Jacobi-Bellman equations. SIAM J. Sci. Comput., 43(2):A1221–A1247, 2021.
  39. Patchy solutions of Hamilton-Jacobi-Bellman partial differential equations. In Modeling, estimation and control, volume 364 of Lect. Notes Control Inf. Sci., pages 251–270. Springer, Berlin, 2007.
  40. Bernt Ø ksendal. Stochastic differential equations. Universitext. Springer-Verlag, Berlin, sixth edition, 2003. An introduction with applications.
  41. S Osher and CW Shu. High-order essentially nonoscillatory schemes for hamilton–jacobi equations. SIAM Journal on numerical analysis, 4:907–922, 1991.
  42. Huyên Pham. Continuous-time stochastic control and optimization with financial applications, volume 61 of Stochastic Modelling and Applied Probability. Springer-Verlag, Berlin, 2009.
  43. Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys., 378:686–707, 2019.
  44. S. Richardson and S. Wang. Numerical solution of Hamilton-Jacobi-Bellman equations by an exponentially fitted finite volume method. Optimization, 55(1-2):121–140, 2006.
  45. Discontinuous Galerkin finite element approximation of Hamilton-Jacobi-Bellman equations with Cordes coefficients. SIAM J. Numer. Anal., 52(2):993–1016, 2014.
  46. Numerical solution of Hamilton-Jacobi-Bellman equations by an upwind finite volume method. volume 27, pages 177–192. 2003. International Workshop on Optimization with High-Technology Applications (OHTA 2000) (Hong Kong).
  47. Stochastic controls, volume 43 of Applications of Mathematics (New York). Springer-Verlag, New York, 1999. Hamiltonian systems and HJB equations.
  48. Weak adversarial networks for high-dimensional partial differential equations. J. Comput. Phys., 411:109409, 14, 2020.
  49. FBSDE based neural network algorithms for high-dimensional quasilinear parabolic PDEs. J. Comput. Phys., 470:Paper No. 111557, 14, 2022.
  50. High order numerical schemes for second-order FBSDEs with applications to stochastic optimal control. Commun. Comput. Phys., 21(3):808–834, 2017.
  51. Actor-critic method for high dimensional static Hamilton-Jacobi-Bellman partial differential equations based on neural networks. SIAM J. Sci. Comput., 43(6):A4043–A4066, 2021.
Citations (1)

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (3)

Collections

Sign up for free to add this paper to one or more collections.