On the Identification and Optimization of Nonsmooth Superposition Operators in Semilinear Elliptic PDEs (2306.05185v2)
Abstract: We study an infinite-dimensional optimization problem that aims to identify the Nemytskii operator in the nonlinear part of a prototypical semilinear elliptic partial differential equation (PDE) which minimizes the distance between the PDE-solution and a given desired state. In contrast to previous works, we consider this identification problem in a low-regularity regime in which the function inducing the Nemytskii operator is a-priori only known to be an element of $H1_{loc}(\mathbb{R})$. This makes the studied problem class a suitable point of departure for the rigorous analysis of training problems for learning-informed PDEs in which an unknown superposition operator is approximated by means of a neural network with nonsmooth activation functions (ReLU, leaky-ReLU, etc.). We establish that, despite the low regularity of the controls, it is possible to derive a classical stationarity system for local minimizers and to solve the considered problem by means of a gradient projection method. The convergence of the resulting algorithm is proven in the function space setting. It is also shown that the established first-order necessary optimality conditions imply that locally optimal superposition operators share various characteristic properties with commonly used activation functions: They are always sigmoidal, continuously differentiable away from the origin, and typically possess a distinct kink at zero. The paper concludes with numerical experiments which confirm the theoretical findings.
- Learning-informed parameter identification in nonlinear time-dependent PDEs. Appl. Math. Optim., 88(3):Paper No. 76, 53, 2023.
- Functions of Bounded Variation and Free Discontinuity Problems. Oxford University Press, Oxford & New York, 2000.
- Variational Analysis in Sobolev and BV Spaces. SIAM, Philadelphia, 2006.
- V. Barbu. Optimal Control of Variational Inequalities. Research Notes in Mathematics. Pitman, 1984.
- V. Barbu and K. Kunisch. Identification of nonlinear elliptic equations. Appl. Math. Optim., 33(2):139–167, 1996.
- A penalty method for the identification of nonlinear elliptic differential operator. Numer. Funct. Anal. Optim., 15(5-6):503–530, 1994.
- A. Beck. Introduction to Nonlinear Optimization. MOS/SIAM Series on Optimization. SIAM, 2014.
- V. I. Bogachev. Measure Theory. Springer, 2007.
- J. F. Bonnans and A. Shapiro. Perturbation Analysis of Optimization Problems. Springer Series in Operations Research. Springer, New York, 2000.
- Discovering governing equations from data by sparse identification of nonlinear dynamical systems. Proc. Nat. Acad. Sci., 113(15):3932–3937, 2016.
- C. Christof. Sensitivity Analysis of Elliptic Variational Inequalities of the First and the Second Kind. PhD thesis, Technische Universität Dortmund, 2018.
- C. Christof. Gradient-based solution algorithms for a class of bilevel optimization and optimal control problems with a nonsmooth lower level. SIAM J. Optim., 30(1):290–318, 2020.
- C. Christof and J. Kowalczyk. On the omnipresence of spurious local minima in certain neural network training problems. Constr. Approx, 2023. to appear.
- Optimal control of a non-smooth semilinear elliptic equation. Math. Control Relat. Fields, 8(1):247–276, 2018.
- C. Christof and G. Müller. Multiobjective optimal control of a non-smooth semilinear elliptic partial differential equation. ESAIM Control Optim. Calc. Var., 27, 2021. Art. S13.
- No-gap second-order optimality conditions for optimal control of a non-smooth quasilinear elliptic equation. ESAIM Control Optim. Calc. Var., 27, 2021. Art. 62.
- On acceleration with noise-corrupted gradients. In Proceedings of the 35th International Conference on Machine Learning (PMLR 80), pages 1019–1028, Stockholm, Sweden, 2018.
- S. Court and K. Kunisch. Design of the monodomain model by artificial neural networks. Discrete Contin. Dyn. Syst., 42(12):6031–6061, 2022.
- A descent algorithm for the optimal control of ReLU neural network informed PDEs based on approximate directional derivatives. arXiv:2210.07900v1, 2022.
- Optimization with learning-informed differential equation constraints and its applications. ESAIM Control Optim. Calc. Var., 28:3, 2022.
- First-order conditions for the optimal control of learning-informed nonsmooth PDEs. arXiv:2206.00297v2, 2022.
- P. Drábek and J. Milota. Methods of Nonlinear Analysis: Applications to Differential Equations. Birkhäuser Verlag, 2007.
- D. Gilbarg and N. S. Trudinger. Elliptic Partial Differential Equations of Second Order. Springer, 2001.
- M. Goebel. Smooth and nonsmooth optimal Lipschitz control — a model problem. In W. H. Schmidt, K. Heier, L. Bittner, and R. Bulirsch, editors, Variational Calculus, Optimal Control and Applications, pages 53–60, Basel, 1998. Birkhäuser Basel.
- P. Grisvard. Elliptic Problems in Nonsmooth Domains. Pitman, 1985.
- S. Grützner and A. Muntean. Identifying processes governing damage evolution in quasi-static elasticity part 1 – analysis. Adv. Math. Sci. Appl., 30(2):305–334, 2021.
- Identification of nonlinear heat transfer laws from boundary observations. Appl. Anal., 94(9):1784–1799, 2015.
- L. Hertlein and M. Ulbrich. An inexact bundle algorithm for nonconvex nonsmooth minimization in Hilbert space. SIAM J. Control Optim., 57(5):3137–3165, 2019.
- M. Hinze and A. Rösch. Discretization of optimal control problems. In G. Leugering, S. Engell, A. Griewank, M. Hinze, R. Rannacher, V. Schulz, M. Ulbrich, and S. Ulbrich, editors, Constrained Optimization and Optimal Control for Partial Differential Equations, pages 391–430, Basel, 2012. Springer Basel.
- M. Josephy. Composing functions of bounded variation. Proc. Amer. Math. Soc, 83(2):354–356, 1981.
- B. Kaltenbacher and T. T. N. Nguyen. Discretization of parameter identification in PDEs using neural networks. Inverse Problems, 38(12):124007, 2022.
- B. Kaltenbacher and W. Rundell. The inverse problem of reconstructing reaction-diffusion systems. Inverse Problems, 36(6):065011, 2020.
- Y. Kian. Lipschitz and Hölder stable determination of nonlinear terms for elliptic equations. Nonlinearity, 36(2):1302, 2023.
- D. Kinderlehrer and G. Stampacchia. An Introduction to Variational Inequalities and Their Applications, volume 31 of Classics in Applied Mathematics. SIAM, 2000.
- B. S. Mityagin. The zero set of a real analytic function. Math. Notes, 107(3):529–530, 2020.
- Kurzweil-Stieltjes Integral: Theory and Applications. Number 15 in Series in Real Analysis. World Scientific, Singapore, 2019.
- Data driven governing equations approximation using deep neural networks. J. Comput. Phys., 395:620–635, 2019.
- A. Rösch. Identification of nonlinear heat transfer laws by optimal control. Num. Funct. Anal. Optim., 15(3-4):417–434, 1994.
- A. Rösch. Fréchet differentiability of the solution of the heat equation with respect to a nonlinear boundary condition. Z. Anal. Anwend., 15(3):603–618, 1996.
- A. Rösch. Second order optimality conditions and stability estimates for the identification of nonlinear heat transfer laws. In W. Desch, F. Kappel, and K. Kunisch, editors, Control and Estimation of Distributed Parameter Systems, pages 237–246, Basel, 1998. Birkhäuser Basel.
- A. Rösch. A Gauss-Newton method for the identification of nonlinear heat transfer laws. In K.-H. Hoffmann, I. Lasiecka, G. Leugering, J. Sprekels, and F. Tröltzsch, editors, Optimal Control of Complex Structures, pages 217–230, Basel, 2002. Birkhäuser Basel.
- A. Rösch and F. Tröltzsch. An optimal control problem arising from the identification of nonlinear heat transfer laws. Arch. Control Sci., 1:4–183, 1992.
- Data-driven discovery of partial differential equations. Sci. Adv., 3(4):e1602614, 2017.
- A. Rösch. Stability estimates for the identification of nonlinear heat transfer laws. Inverse Problems, 12(5):743–756, 1996.
- B. Schweizer. Partielle Differentialgleichungen. Springer-Verlag, Berlin/Heidelberg, 2013.
- G. Stadler. Elliptic optimal control problems with L1-control cost and applications for the placement of control devices. Comput. Optim. Appl., 44(159), 2009.
- F. Tröltzsch. Optimal Control of Partial Differential Equations. AMS, 2010.
- W. P. Ziemer. Weakly Differentiable Functions. Springer Verlag, New York, 1989.