Gaussian Universality in Neural Network Dynamics with Generalized Structured Input Distributions (2405.00642v3)

Published 1 May 2024 in stat.ML, cond-mat.dis-nn, cond-mat.stat-mech, and cs.LG

Abstract: Bridging the gap between the practical performance of deep learning and its theoretical foundations often involves analyzing neural networks through stochastic gradient descent (SGD). Expanding on previous research that focused on modeling structured inputs under a simple Gaussian setting, we analyze the behavior of a deep learning system trained on inputs modeled as Gaussian mixtures to better simulate more general structured inputs. Through empirical analysis and theoretical investigation, we demonstrate that under certain standardization schemes, the deep learning model converges toward Gaussian setting behavior, even when the input data follow more complex or real-world distributions. This finding exhibits a form of universality in which diverse structured distributions yield results consistent with Gaussian assumptions, which can support the theoretical understanding of deep learning models.

References (15)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/StatMLPapers/status/1785883043422503053

https://twitter.com/LFUS/status/1785908680812056762

https://twitter.com/LFUS/status/1852270729657462873

https://twitter.com/LFUS/status/1924917203683459376

Gaussian Universality in Neural Network Dynamics with Generalized Structured Input Distributions (2405.00642v3)

Summary

Related Papers

Tweets