2000 character limit reached
Continuous limits of residual neural networks in case of large input data (2112.14150v2)
Published 28 Dec 2021 in math.AP, cs.NA, math.NA, and math.OC
Abstract: Residual deep neural networks (ResNets) are mathematically described as interacting particle systems. In the case of infinitely many layers the ResNet leads to a system of coupled system of ordinary differential equations known as neural differential equations. For large scale input data we derive a mean--field limit and show well--posedness of the resulting description. Further, we analyze the existence of solutions to the training process by using both a controllability and an optimal control point of view. Numerical investigations based on the solution of a formal optimality system illustrate the theoretical findings.
- M. Herty (13 papers)
- A. Thuenen (1 paper)
- T. Trimborn (3 papers)
- G. Visconti (9 papers)