Sparsity-depth Tradeoff in Infinitely Wide Deep Neural Networks (2305.10550v1)

Published 17 May 2023 in cs.LG, cond-mat.dis-nn, and q-bio.NC

Abstract: We investigate how sparse neural activity affects the generalization performance of a deep Bayesian neural network at the large width limit. To this end, we derive a neural network Gaussian Process (NNGP) kernel with rectified linear unit (ReLU) activation and a predetermined fraction of active neurons. Using the NNGP kernel, we observe that the sparser networks outperform the non-sparse networks at shallow depths on a variety of datasets. We validate this observation by extending the existing theory on the generalization error of kernel-ridge regression.

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Related Papers

Convolutional Deep Kernel Machines (2023)
Finite Versus Infinite Neural Networks: an Empirical Study (2020)
Guided Deep Kernel Learning (2023)
Fast Neural Kernel Embeddings for General Activations (2022)
Wide Neural Networks with Bottlenecks are Deep Gaussian Processes (2020)