An Efficient Difference-of-Convex Solver for Privacy Funnel (2403.04778v3)
Abstract: We propose an efficient solver for the privacy funnel (PF) method, leveraging its difference-of-convex (DC) structure. The proposed DC separation results in a closed-form update equation, which allows straightforward application to both known and unknown distribution settings. For known distribution case, we prove the convergence (local stationary points) of the proposed non-greedy solver, and empirically show that it outperforms the state-of-the-art approaches in characterizing the privacy-utility trade-off. The insights of our DC approach apply to unknown distribution settings where labeled empirical samples are available instead. Leveraging the insights, our alternating minimization solver satisfies the fundamental Markov relation of PF in contrast to previous variational inference-based solvers. Empirically, we evaluate the proposed solver with MNIST and Fashion-MNIST datasets. Our results show that under a comparable reconstruction quality, an adversary suffers from higher prediction error from clustering our compressed codes than that with the compared methods. Most importantly, our solver is independent to private information in inference phase contrary to the baselines.
- A. Makhdoumi, S. Salamatian, N. Fawaz, and M. Médard, “From the information bottleneck to the privacy funnel,” in 2014 IEEE Information Theory Workshop (ITW 2014), pp. 501–505, 2014.
- T. M. Cover, Elements of information theory. John Wiley & Sons, 1999.
- N. Ding and P. Sadeghi, “A submodularity-based clustering algorithm for the information bottleneck and privacy funnel,” in 2019 IEEE Information Theory Workshop (ITW), pp. 1–5, 2019.
- T.-H. Huang, A. E. Gamal, and H. E. Gamal, “A linearly convergent Douglas-Rachford splitting solver for Markovian information-theoretic optimization problems,” IEEE Transactions on Information Theory, vol. 69, no. 5, pp. 3372–3399, 2023.
- H. Attouch, J. Bolte, P. Redont, and A. Soubeyran, “Proximal alternating minimization and projection methods for nonconvex problems: An approach based on the Kurdyka-łojasiewicz inequality,” Mathematics of operations research, vol. 35, no. 2, pp. 438–457, 2010.
- A. A. Alemi, I. Fischer, J. V. Dillon, and K. Murphy, “Deep variational information bottleneck,” arXiv preprint arXiv:1612.00410, 2016.
- B. Rodríguez-Gálvez, R. Thobaben, and M. Skoglund, “A variational approach to privacy and fairness,” in 2021 IEEE Information Theory Workshop (ITW), pp. 1–6, IEEE, 2021.
- B. Razeghi, P. Rahimi, and S. Marcel, “Deep variational privacy funnel: General modeling with applications in face recognition,” arXiv preprint arXiv:2401.14792, 2024.
- H. A. Le Thi and T. Pham Dinh, “Dc programming and dca: thirty years of developments,” Mathematical Programming, vol. 169, no. 1, pp. 5–68, 2018.
- F. P. Calmon, A. Makhdoumi, and M. Médard, “Fundamental limits of perfect privacy,” in 2015 IEEE International Symposium on Information Theory (ISIT), pp. 1796–1800, 2015.
- C. Schieler and P. Cuff, “Rate-distortion theory for secrecy systems,” IEEE Transactions on Information Theory, vol. 60, no. 12, pp. 7584–7605, 2014.
- K. Kittichokechai and G. Caire, “Privacy-constrained remote source coding,” in 2016 IEEE International Symposium on Information Theory (ISIT), pp. 1078–1082, 2016.
- Y. Yakimenka, H.-Y. Lin, E. Rosnes, and J. Kliewer, “Optimal rate-distortion-leakage tradeoff for single-server information retrieval,” IEEE Journal on Selected Areas in Communications, vol. 40, no. 3, pp. 832–846, 2022.
- Y. Y. Shkel, R. S. Blum, and H. V. Poor, “Secrecy by design with applications to privacy and compression,” IEEE Transactions on Information Theory, vol. 67, no. 2, pp. 824–843, 2021.
- D. L. Donoho, “Compressed sensing,” IEEE Transactions on information theory, vol. 52, no. 4, pp. 1289–1306, 2006.
- N. R. Draper and F. Pukelsheim, “Generalized ridge analysis under linear restrictions, with particular applications to mixture experiments problems,” Technometrics, vol. 44, no. 3, pp. 250–259, 2002.
- B. R. Gaines, J. Kim, and H. Zhou, “Algorithms for fitting the constrained lasso,” Journal of Computational and Graphical Statistics, vol. 27, no. 4, pp. 861–871, 2018.
- M. Pilanci, L. Ghaoui, and V. Chandrasekaran, “Recovery of sparse probability measures via convex programming,” Advances in Neural Information Processing Systems, vol. 25, 2012.
- F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay, “Scikit-learn: Machine learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.
- D. P. Kingma and M. Welling, “Auto-encoding variational Bayes,” arXiv preprint arXiv:1312.6114, 2013.
- J. Duchi, “Derivations for linear algebra and optimization,” Berkeley, California, vol. 3, no. 1, pp. 2325–5870, 2007.
- Y. Park, C. Kim, and G. Kim, “Variational Laplace autoencoders,” in International conference on machine learning, pp. 5032–5041, PMLR, 2019.
- T.-H. Huang, T. Dahanayaka, K. Thilakarathna, P. H. Leong, and H. El Gamal, “The Wyner variational autoencoder for unsupervised multi-layer wireless fingerprinting,” in GLOBECOM 2023 - 2023 IEEE Global Communications Conference, pp. 820–825, 2023.
- S. Asoodeh and F. P. Calmon, “Bottleneck problems: An information and estimation-theoretic view,” Entropy, vol. 22, no. 11, p. 1325, 2020.
- Y. LeCun, C. Cortes, and C. Burges, “Mnist handwritten digit database,” ATT Labs [Online]. Available: http://yann.lecun.com/exdb/mnist, vol. 2, 2010.
- H. Xiao, K. Rasul, and R. Vollgraf, “Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms,” arXiv preprint arXiv:1708.07747, 2017.
- L. McInnes, J. Healy, and J. Melville, “Umap: Uniform manifold approximation and projection for dimension reduction,” arXiv preprint arXiv:1802.03426, 2018.
- D. P. Kingma and J. Ba, “ADAM: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.
- T.-H. Huang and H. El Gamal, “Efficient alternating minimization solvers for Wyner multi-view unsupervised learning,” in 2023 IEEE International Symposium on Information Theory (ISIT), pp. 707–712, 2023.
- T.-H. Huang and H. E. Gamal, “Efficient solvers for Wyner common information with application to multi-modal clustering,” arXiv preprint arXiv:2402.14266, 2024.