Manipulating Sparse Double Descent (2401.10686v1)

Published 19 Jan 2024 in cs.LG

Abstract: This paper investigates the double descent phenomenon in two-layer neural networks, focusing on the role of L1 regularization and representation dimensions. It explores an alternative double descent phenomenon, named sparse double descent. The study emphasizes the complex relationship between model complexity, sparsity, and generalization, and suggests further research into more diverse models and datasets. The findings contribute to a deeper understanding of neural network training and optimization.

References (11)

Authors (1)

Ya Shi Zhang (3 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/fly51fly/status/1751729362284544091

Manipulating Sparse Double Descent (2401.10686v1)

Summary

Related Papers

Tweets