Neural Network Approaches for Parameterized Optimal Control (2402.10033v1)
Abstract: We consider numerical approaches for deterministic, finite-dimensional optimal control problems whose dynamics depend on unknown or uncertain parameters. We seek to amortize the solution over a set of relevant parameters in an offline stage to enable rapid decision-making and be able to react to changes in the parameter in the online stage. To tackle the curse of dimensionality arising when the state and/or parameter are high-dimensional, we represent the policy using neural networks. We compare two training paradigms: First, our model-based approach leverages the dynamics and definition of the objective function to learn the value function of the parameterized optimal control problem and obtain the policy using a feedback form. Second, we use actor-critic reinforcement learning to approximate the policy in a data-driven way. Using an example involving a two-dimensional convection-diffusion equation, which features high-dimensional state and parameter spaces, we investigate the accuracy and efficiency of both training paradigms. While both paradigms lead to a reasonable approximation of the policy, the model-based approach is more accurate and considerably reduces the number of PDE solves.
- A survey of exploration methods in reinforcement learning. arXiv preprint arXiv:2109.00157, 2021.
- Numerical modeling of hemodynamics scenarios of patient-specific coronary artery bypass grafts. Biomechanics and Modeling in Mechanobiology, 16:1373–1399, 2017.
- DeepReach: A deep learning approach to high-dimensional reachability. In IEEE International Conference on Robotics and Automation (ICRA), pages 1817–1824, 2021.
- Dota 2 with large scale deep reinforcement learning. ArXiv, abs/1912.06680, 2019.
- Dimitri P. Bertsekas. Reinforcement learning and optimal control. Athena Scientific Optimization and Computation Series. Athena Scientific, Belmont, MA, [2019] ©2019. Second printing with editorial revisions.
- Optimal motorway traffic flow control involving variable speed limits and ramp metering. Transportation Science, 44(2):238–253, 2010.
- Top-k off-policy correction for a reinforce recommender system. Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2018.
- An extended physics informed neural network for preliminary analysis of parametric optimal control problems. arXiv:2110.13530, 2021.
- Controlled Markov Processes and Viscosity Solutions, volume 25 of Stochastic Modelling and Applied Probability. Springer, New York, second edition, 2006.
- Addressing function approximation error in actor-critic methods. In Jennifer Dy and Andreas Krause, editors, Proceedings of the 35th International Conference on Machine Learning, volume 80 of Proceedings of Machine Learning Research, pages 1587–1596. PMLR, 10–15 Jul 2018.
- Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016.
- Deep reinforcement learning that matters. In Proceedings of the AAAI conference on artificial intelligence, volume 32, 2018.
- Solving PDE-constrained control problems using operator learning. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 4504–4512, 2022.
- Learning agile and dynamic motor skills for legged robots. Science Robotics, 4, 2019.
- Alex Irpan. Deep reinforcement learning doesn’t work yet. 2018.
- Algorithms of data development for deep learning and feedback design. arXiv:1912.00492, 2019.
- Adam: A method for stochastic optimization. arXiv:1412.6980, 2014.
- Semiglobal optimal feedback stabilization of autonomous systems via deep neural network approximation. ESAIM: Control, Optimisation and Calculus of Variations, 27, 2021.
- Optimal feedback control of dynamical systems via value-function approximation. arXiv:2302.13122, 2023.
- A neural network approach for stochastic optimal control. arXiv:2209.13104, 2022.
- Solutions for multiagent pursuit-evasion games on communication graphs: Finite-time capture and asymptotic behaviors. IEEE Transactions on Automatic Control (TAC), 65(5):1911–1923, 2019.
- Physics-informed neural networks with hard constraints for inverse design. SIAM Journal on Scientific Computing, 43(6):B1105–B1132, 2021.
- Iterative surrogate model optimization (ISMO): an active learning algorithm for PDE constrained optimization with deep neural networks. Computer Methods in Applied Mechanics and Engineering, 374:113575, 2021.
- Playing atari with deep reinforcement learning. arXiv, 1312.5602, 2013.
- Optimal control of PDEs using physics-informed neural networks. Journal of Computational Physics, page 111731, 2022.
- Improving stability in deep reinforcement learning with weight averaging. In Uncertainty in artificial intelligence workshop on uncertainty in Deep learning, 2018.
- A neural network approach for high-dimensional optimal control. arXiv:2104.03270, 2021.
- Reduction strategies for PDE-constrained optimization problems in haemodynamics. In Proceedings of the 6th European Congress on Computational Methods in Applied Sciences and Engineering, number CONF, pages 1748–1769. Vienna Technical University, 2012.
- Proximal policy optimization algorithms. arXiv:1707.06347, 2017.
- Mastering the game of go with deep neural networks and tree search. Nature, 529:484–489, 2016.
- A general reinforcement learning algorithm that masters chess, shogi, and go through self-play. Science, 362:1140 – 1144, 2018.
- Model reduction for parametrized optimal control problems in environmental marine sciences and engineering. SIAM Journal on Scientific Computing, 40(4):B1055–B1079, 2018.
- Reinforcement learning: An introduction. MIT press, 2018.
- Fredi Tröltzsch. Optimal Control of Partial Differential Equations: Theory, Methods and Applications, volume 112 of Graduate Studies in Mathematics. 2010.
- Fast PDE-constrained optimization via self-supervised operator learning. arXiv:2110.13297, 2021.
- Learning to see physics via visual de-animation. In NIPS, 2017.
- Machine learning for adjoint vector in aerodynamic shape optimization. Acta Mechanica Sinica, pages 1–17, 2021.
- AONN: An adjoint-oriented neural network method for all-at-once solutions of parametric optimal control problems. arXiv:2302.02076, 2023.
- Stochastic controls, volume 43 of Applications of Mathematics (New York). Springer-Verlag, New York, 1999. Hamiltonian systems and HJB equations.
- Actor-critic method for high dimensional static Hamilton–Jacobi–Bellman partial differential equations based on neural networks. SIAM Journal on Scientific Computing, 43(6):A4043–A4066, jan 2021.
- Mo Zhou and Jianfeng Lu. A policy gradient framework for stochastic optimal control problems with global convergence guarantee. arXiv:2302.05816, 2023.
- Hao Dong Zihan Ding. Challenges of reinforcement learning. In Shanghang Zhang Hao Dong, Zihan Ding, editor, Deep Reinforcement Learning: Fundamentals, Research, and Applications, chapter 7, pages 249–272. Springer Nature, 2020. http://www.deepreinforcementlearningbook.org.
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.