To Compress or Not to Compress- Self-Supervised Learning and Information Theory: A Review (2304.09355v5)

Published 19 Apr 2023 in cs.LG, cs.IT, and math.IT

Abstract: Deep neural networks excel in supervised learning tasks but are constrained by the need for extensive labeled data. Self-supervised learning emerges as a promising alternative, allowing models to learn without explicit labels. Information theory, and notably the information bottleneck principle, has been pivotal in shaping deep neural networks. This principle focuses on optimizing the trade-off between compression and preserving relevant information, providing a foundation for efficient network design in supervised contexts. However, its precise role and adaptation in self-supervised learning remain unclear. In this work, we scrutinize various self-supervised learning approaches from an information-theoretic perspective, introducing a unified framework that encapsulates the \textit{self-supervised information-theoretic learning problem}. We weave together existing research into a cohesive narrative, delve into contemporary self-supervised methodologies, and spotlight potential research avenues and inherent challenges. Additionally, we discuss the empirical evaluation of information-theoretic quantities and their estimation methods. Overall, this paper furnishes an exhaustive review of the intersection of information theory, self-supervised learning, and deep neural networks.

Citations (56)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/123543935/status/1695864115745399185

https://twitter.com/bloodbatmcgrath/status/1832772496543600805

https://twitter.com/fabmilo/status/1795655875690115581

To Compress or Not to Compress- Self-Supervised Learning and Information Theory: A Review (2304.09355v5)

Summary

Related Papers

Tweets