Learning Functions: When Is Deep Better Than Shallow (1603.00988v4)

Published 3 Mar 2016 in cs.LG

Abstract: While the universal approximation property holds both for hierarchical and shallow networks, we prove that deep (hierarchical) networks can approximate the class of compositional functions with the same accuracy as shallow networks but with exponentially lower number of training parameters as well as VC-dimension. This theorem settles an old conjecture by Bengio on the role of depth in networks. We then define a general class of scalable, shift-invariant algorithms to show a simple and natural set of requirements that justify deep convolutional networks.

Citations (142)

View on Semantic Scholar

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Learning Functions: When Is Deep Better Than Shallow (1603.00988v4)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (3)