Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Large-Scale Public Data Improves Differentially Private Image Generation Quality (2309.00008v1)

Published 4 Aug 2023 in cs.CV, cs.CR, and cs.LG

Abstract: Public data has been frequently used to improve the privacy-accuracy trade-off of differentially private machine learning, but prior work largely assumes that this data come from the same distribution as the private. In this work, we look at how to use generic large-scale public data to improve the quality of differentially private image generation in Generative Adversarial Networks (GANs), and provide an improved method that uses public data effectively. Our method works under the assumption that the support of the public data distribution contains the support of the private; an example of this is when the public data come from a general-purpose internet-scale image source, while the private data consist of images of a specific type. Detailed evaluations show that our method achieves SOTA in terms of FID score and other metrics compared with existing methods that use public data, and can generate high-quality, photo-realistic images in a differentially private manner.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Ruihan Wu (23 papers)
  2. Chuan Guo (77 papers)
  3. Kamalika Chaudhuri (122 papers)
Citations (2)
Youtube Logo Streamline Icon: https://streamlinehq.com