Fair Sampling in Diffusion Models through Switching Mechanism (2401.03140v5)
Abstract: Diffusion models have shown their effectiveness in generation tasks by well-approximating the underlying probability distribution. However, diffusion models are known to suffer from an amplified inherent bias from the training data in terms of fairness. While the sampling process of diffusion models can be controlled by conditional guidance, previous works have attempted to find empirical guidance to achieve quantitative fairness. To address this limitation, we propose a fairness-aware sampling method called \textit{attribute switching} mechanism for diffusion models. Without additional training, the proposed sampling can obfuscate sensitive attributes in generated data without relying on classifiers. We mathematically prove and experimentally demonstrate the effectiveness of the proposed method on two key aspects: (i) the generation of fair data and (ii) the preservation of the utility of the generated data.
- Uncovering and mitigating algorithmic bias through learned latent structure. In Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 289–295.
- Anderson, B. D. 1982. Reverse-time diffusion equation models. Stochastic Processes and their Applications, 12(3): 313–326.
- ediffi: Text-to-image diffusion models with an ensemble of expert denoisers. arXiv preprint arXiv:2211.01324.
- Align your latents: High-resolution video synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 22563–22575.
- Sega: Instructing diffusion using semantic dimensions. arXiv preprint arXiv:2301.12247.
- Perception prioritized training of diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 11472–11481.
- Fair generative modeling via weak supervision. In International Conference on Machine Learning, 1887–1898. PMLR.
- Diffedit: Diffusion-based semantic image editing with mask guidance. arXiv preprint arXiv:2210.11427.
- Diffusion models beat gans on image synthesis. Advances in neural information processing systems, 34: 8780–8794.
- Fairness through awareness. In Proceedings of the 3rd innovations in theoretical computer science conference, 214–226.
- Certifying and removing disparate impact. In proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, 259–268.
- Fair diffusion: Instructing text-to-image generation models on fairness. arXiv preprint arXiv:2302.10893.
- Equality of opportunity in supervised learning. Advances in neural information processing systems, 29.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, 770–778.
- Denoising diffusion probabilistic models. Advances in neural information processing systems, 33: 6840–6851.
- Classifier-Free Diffusion Guidance. In NeurIPS 2021 Workshop on Deep Generative Models and Downstream Applications.
- Polarity sampling: Quality and diversity control of pre-trained generative networks via singular values. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 10641–10650.
- Adversarial examples are not bugs, they are features. Advances in neural information processing systems, 32.
- Fairface: Face attribute dataset for balanced race, gender, and age for bias measurement and mitigation. In Proceedings of the IEEE/CVF winter conference on applications of computer vision, 1548–1558.
- FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age for Bias Measurement and Mitigation. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 1548–1558.
- Elucidating the design space of diffusion-based generative models. Advances in Neural Information Processing Systems, 35: 26565–26577.
- On the fairness of generative adversarial networks (gans). In 2021 International Conference” Nonlinearity, Information and Robotics”(NIR), 1–7. IEEE.
- Variational diffusion models. Advances in neural information processing systems, 34: 21696–21707.
- DiffWave: A Versatile Diffusion Model for Audio Synthesis. In International Conference on Learning Representations.
- Learning multiple layers of features from tiny images.
- Diffusion Models Already Have A Semantic Latent Space. In The Eleventh International Conference on Learning Representations.
- Deep Learning Face Attributes in the Wild. In Proceedings of International Conference on Computer Vision (ICCV).
- Large-scale celebfaces attributes (celeba) dataset. Retrieved August, 15(2018): 11.
- Learning transferable features with deep adaptation networks. In International conference on machine learning, 97–105. PMLR.
- Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps. Advances in Neural Information Processing Systems, 35: 5775–5787.
- Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125.
- High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 10684–10695.
- Fairness GAN: Generating datasets with fairness properties using a generative adversarial network. IBM Journal of Research and Development, 63(4/5): 3–1.
- D2c: Diffusion-decoding models for few-shot conditional generation. Advances in Neural Information Processing Systems, 34: 12533–12548.
- Denoising Diffusion Implicit Models. In International Conference on Learning Representations.
- Score-Based Generative Modeling through Stochastic Differential Equations. In International Conference on Learning Representations.
- Fairness definitions explained. In Proceedings of the international workshop on software fairness, 1–7.
- Diffusers: State-of-the-art diffusion models. https://github.com/huggingface/diffusers.
- Fairgan: Fairness-aware generative adversarial networks. In 2018 IEEE International Conference on Big Data (Big Data), 570–575. IEEE.
- Fairgan+: Achieving fair data generation and classification through generative adversarial nets. In 2019 IEEE International Conference on Big Data (Big Data), 1401–1406. IEEE.
- Yujin Choi (10 papers)
- Jinseong Park (13 papers)
- Hoki Kim (11 papers)
- Jaewook Lee (44 papers)
- Saerom Park (3 papers)