Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Information Bottleneck Revisited: Posterior Probability Perspective with Optimal Transport (2308.11296v1)

Published 22 Aug 2023 in cs.IT and math.IT

Abstract: Information bottleneck (IB) is a paradigm to extract information in one target random variable from another relevant random variable, which has aroused great interest due to its potential to explain deep neural networks in terms of information compression and prediction. Despite its great importance, finding the optimal bottleneck variable involves a difficult nonconvex optimization problem due to the nonconvexity of mutual information constraint. The Blahut-Arimoto algorithm and its variants provide an approach by considering its Lagrangian with fixed Lagrange multiplier. However, only the strictly concave IB curve can be fully obtained by the BA algorithm, which strongly limits its application in machine learning and related fields, as strict concavity cannot be guaranteed in those problems. To overcome the above difficulty, we derive an entropy regularized optimal transport (OT) model for IB problem from a posterior probability perspective. Correspondingly, we use the alternating optimization procedure and generalize the Sinkhorn algorithm to solve the above OT model. The effectiveness and efficiency of our approach are demonstrated via numerical experiments.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Lingyi Chen (7 papers)
  2. Shitong Wu (11 papers)
  3. Wenhao Ye (10 papers)
  4. Huihui Wu (16 papers)
  5. Hao Wu (623 papers)
  6. Wenyi Zhang (82 papers)
  7. Bo Bai (71 papers)
  8. Yining Sun (8 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.