Privacy-Preserving Student Learning with Differentially Private Data-Free Distillation (2409.12384v1)

Published 19 Sep 2024 in cs.LG, cs.AI, cs.CR, and cs.CV

Abstract: Deep learning models can achieve high inference accuracy by extracting rich knowledge from massive well-annotated data, but may pose the risk of data privacy leakage in practical deployment. In this paper, we present an effective teacher-student learning approach to train privacy-preserving deep learning models via differentially private data-free distillation. The main idea is generating synthetic data to learn a student that can mimic the ability of a teacher well-trained on private data. In the approach, a generator is first pretrained in a data-free manner by incorporating the teacher as a fixed discriminator. With the generator, massive synthetic data can be generated for model training without exposing data privacy. Then, the synthetic data is fed into the teacher to generate private labels. Towards this end, we propose a label differential privacy algorithm termed selective randomized response to protect the label information. Finally, a student is trained on the synthetic data with the supervision of private labels. In this way, both data privacy and label privacy are well protected in a unified framework, leading to privacy-preserving models. Extensive experiments and analysis clearly demonstrate the effectiveness of our approach.

Authors (7)

Bochao Liu (12 papers)
Jianghu Lu (2 papers)
Pengju Wang (19 papers)
Junjie Zhang (79 papers)
Dan Zeng (54 papers)
Zhenxing Qian (54 papers)
Shiming Ge (47 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/Kokingkoal/status/1837047457823232241

Privacy-Preserving Student Learning with Differentially Private Data-Free Distillation (2409.12384v1)

Summary

Related Papers

Tweets