Stable Diffusion Exposed: Gender Bias from Prompt to Image (2312.03027v2)
Abstract: Several studies have raised awareness about social biases in image generative models, demonstrating their predisposition towards stereotypes and imbalances. This paper contributes to this growing body of research by introducing an evaluation protocol that analyzes the impact of gender indicators at every step of the generation process on Stable Diffusion images. Leveraging insights from prior work, we explore how gender indicators not only affect gender presentation but also the representation of objects and layouts within the generated images. Our findings include the existence of differences in the depiction of objects, such as instruments tailored for specific genders, and shifts in overall layouts. We also reveal that neutral prompts tend to produce images more aligned with masculine prompts than their feminine counterparts. We further explore where bias originates through representational disparities and how it manifests in the images via prompt-image dependencies, and provide recommendations for developers and users to mitigate potential bias in image generation.
- Inspecting the geographical representativeness of images from text-to-image models. In ICCV, 2023.
- A prompt array keeps the bias away: Debiasing vision-language models with adversarial learning. In AACL-IJNCLP, 2022.
- Improving image generation with better captions.
- Easily accessible text-to-image generation amplifies demographic stereotypes at large scale. In FAccT, 2023.
- Natural language processing with Python: analyzing text with the natural language toolkit. 2009.
- Multimodal datasets: misogyny, pornography, and malignant stereotypes. arXiv preprint arXiv:2110.01963, 2021.
- Into the LAION’s Den: Investigating hate in multimodal datasets. In NeurlPS Datasets and Benchmarks Track, 2023.
- G. Bradski. The OpenCV Library. Dr. Dobb’s Journal of Software Tools, 2000.
- Language models are few-shot learners. In NeurlPS, 2020.
- Semantics derived automatically from language corpora contain human-like biases. Science, 2017.
- Extracting training data from diffusion models. In USENIX Security 23, 2023.
- Emerging properties in self-supervised vision transformers. In ICCV, 2021.
- Dall-Eval: Probing the reasoning skills and social biases of text-to-image generation models. In ICCV, 2023.
- CogView: Mastering text-to-image generation via transformers. In NeurlPS, 2021.
- CogView2: Faster and better text-to-image generation via hierarchical transformers. In NeurlPS, 2022.
- An image is worth 16x16 words: Transformers for image recognition at scale. In ICLR, 2021.
- Diffusion self-guidance for controllable image generation. In NeurlPS, 2023.
- Erasing concepts from diffusion models. In ICCV, 2023a.
- Unified concept editing in diffusion models. arXiv preprint arXiv:2308.14761, 2023b.
- Uncurated image-text datasets: Shedding light on demographic bias. In CVPR, 2023.
- GenEval: An object-focused framework for evaluating text-to-image alignment. arXiv preprint arXiv:2310.11513, 2023.
- Generative adversarial networks. Communications of the ACM, 2020.
- DIG In: Evaluating disparities in image generations with indicators for geographic diversity. arXiv preprint arXiv:2308.06198, 2023.
- Deep residual learning for image recognition. In CVPR, 2016.
- Prompt-to-Prompt image editing with cross-attention control. In ICLR, 2023.
- CLIPScore: A reference-free evaluation metric for image captioning. In EMNLP, 2021.
- GANs trained by a two time-scale update rule converge to a local nash equilibrium. In NeurlPS, 2017.
- Gender and racial bias in visual question answering datasets. In FAccT, 2022.
- Denoising diffusion probabilistic models. In NeurlPS, 2020.
- FairFace: Face attribute dataset for balanced race, gender, and age for bias measurement and mitigation. In WACV, 2021.
- Bias-to-Text: Debiasing unknown visual biases through language interpretation. arXiv preprint arXiv:2301.11104, 2023.
- Segment anything. In ICCV, 2023.
- Holistic evaluation of text-to-image models. In NeurlPS Datasets and Benchmarks Track, 2023.
- BART: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In ACL, 2020.
- BLIP-2: Bootstrapping language-image pre-training with frozen image encoders and large language models. In ICML, 2023.
- Word-level explanations for analyzing bias in text-to-image models. arXiv preprint arXiv:2306.05500, 2023.
- Microsoft COCO: Common objects in context. In ECCV, 2014.
- Grounding DINO: Marrying dino with grounded pre-training for open-set object detection. In arXiv preprint arXiv:2303.05499, 2023.
- TF-ICON: Diffusion-based training-free cross-domain image composition. In ICCV, 2023.
- Stable bias: Analyzing societal representations in diffusion models. In NeurlPS, 2023.
- Multimodal composite association score: Measuring gender bias in generative multimodal models. arXiv preprint arXiv:2304.13855, 2023.
- Harvey Mannering. Analysing gender bias in text-to-image models using object detection. arXiv preprint arXiv:2307.08025, 2023.
- ClipCap: Clip prefix for image captioning. arXiv preprint arXiv:2111.09734, 2021.
- Social biases through the text-to-image generation lens. In AIES, 2023.
- Toward verifiable and reproducible human evaluation for text-to-image generation. In CVPR, 2023.
- BLEU: a method for automatic evaluation of machine translation. In ACL, 2002.
- LD-ZNet: A latent diffusion approach for text-based image segmentation. In ICCV, 2023.
- Learning transferable visual models from natural language supervision. In ICML, 2021.
- Zero-shot text-to-image generation. In ICML, 2021.
- Hierarchical text-conditional image generation with CLIP latents. arXiv preprint arXiv:2204.06125, 2022.
- Generative adversarial text to image synthesis. In ICML, 2016.
- High-resolution image synthesis with latent diffusion models. In CVPR, 2022.
- U-Net: Convolutional networks for biomedical image segmentation. In MICCAI, 2015.
- Photorealistic text-to-image diffusion models with deep language understanding. In NeurlPS, 2022.
- Improved techniques for training gans. In NeurlPS, 2016.
- The bias amplification paradox in text-to-image generation. arXiv preprint arXiv:2308.00755, 2023.
- Conceptual Captions: A cleaned, hypernymed, image alt-text dataset for automatic image captioning. In ACL, 2018.
- TextCaps: a dataset for image captioning with reading comprehension. In ECCV, 2020.
- Diffusion art or digital forgery? investigating data replication in diffusion models. In CVPR, 2023.
- What the DAAM: Interpreting stable diffusion using cross attention. In ACL, 2023.
- DF-GAN: A simple and effective baseline for text-to-image synthesis. In CVPR, 2022.
- Stereotypes and smut: The (mis) representation of non-cisgender identities by text-to-image models. In ACL, 2023.
- CIDEr: Consensus-based image description evaluation. In CVPR, 2015.
- T2IAT: Measuring valence and stereotypical biases in text-to-image generation. In ACL, 2023a.
- Evaluating data attribution for text-to-image models. In ICCV, 2023b.
- Image quality assessment: from error visibility to structural similarity. IEEE TIP, 2004.
- Contrastive language-vision ai models pretrained on web-scraped multimodal data exhibit sexual objectification bias. In FAccT, 2023.
- Diffumask: Synthesizing images with pixel-level annotations for semantic segmentation using diffusion models. In ICCV, 2023.
- Attention as annotation: Generating images and pseudo-masks for weakly supervised semantic segmentation with diffusion. arXiv preprint arXiv:2309.01369, 2023.
- From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. In ACL, 2014.
- Scaling autoregressive models for content-rich text-to-image generation. TMLR, 2022.
- ITI-GEN: Inclusive text-to-image generation. In CVPR, 2023a.
- Recognize anything: A strong image tagging model. arXiv preprint arXiv:2306.03514, 2023b.
- Auditing gender presentation differences in text-to-image models. arXiv preprint arXiv:2302.03675, 2023c.
- Men also like shopping: Reducing gender bias amplification using corpus-level constraints. In EMNLP, 2017.
- Yankun Wu (4 papers)
- Yuta Nakashima (67 papers)
- Noa Garcia (33 papers)