Continuous limits of residual neural networks in case of large input data (2112.14150v2)

Published 28 Dec 2021 in math.AP, cs.NA, math.NA, and math.OC

Abstract: Residual deep neural networks (ResNets) are mathematically described as interacting particle systems. In the case of infinitely many layers the ResNet leads to a system of coupled system of ordinary differential equations known as neural differential equations. For large scale input data we derive a mean--field limit and show well--posedness of the resulting description. Further, we analyze the existence of solutions to the training process by using both a controllability and an optimal control point of view. Numerical investigations based on the solution of a formal optimality system illustrate the theoretical findings.

Authors (4)

M. Herty (13 papers)
A. Thuenen (1 paper)
T. Trimborn (3 papers)
G. Visconti (9 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Continuous limits of residual neural networks in case of large input data (2112.14150v2)

Summary

Related Papers