Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions (2301.07966v1)

Published 19 Jan 2023 in cs.LG and math.OC

Abstract: One surprising trait of neural networks is the extent to which their connections can be pruned with little to no effect on accuracy. But when we cross a critical level of parameter sparsity, pruning any further leads to a sudden drop in accuracy. This drop plausibly reflects a loss in model complexity, which we aim to avoid. In this work, we explore how sparsity also affects the geometry of the linear regions defined by a neural network, and consequently reduces the expected maximum number of linear regions based on the architecture. We observe that pruning affects accuracy similarly to how sparsity affects the number of linear regions and our proposed bound for the maximum number. Conversely, we find out that selecting the sparsity across layers to maximize our bound very often improves accuracy in comparison to pruning as much with the same sparsity in all layers, thereby providing us guidance on where to prune.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (8)

Junyang Cai (5 papers)
Khai-Nguyen Nguyen (7 papers)
Nishant Shrestha (1 paper)
Aidan Good (2 papers)
Ruisen Tu (3 papers)
Xin Yu (192 papers)
Shandian Zhe (58 papers)
Thiago Serra (18 papers)

Citations (6)

View on Semantic Scholar

Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions (2301.07966v1)

Related Papers