T-HITL Effectively Addresses Problematic Associations in Image Generation and Maintains Overall Visual Quality (2402.17101v1)
Abstract: Generative AI image models may inadvertently generate problematic representations of people. Past research has noted that millions of users engage daily across the world with these models and that the models, including through problematic representations of people, have the potential to compound and accelerate real-world discrimination and other harms (Bianchi et al, 2023). In this paper, we focus on addressing the generation of problematic associations between demographic groups and semantic concepts that may reflect and reinforce negative narratives embedded in social data. Building on sociological literature (Blumer, 1958) and mapping representations to model behaviors, we have developed a taxonomy to study problematic associations in image generation models. We explore the effectiveness of fine tuning at the model level as a method to address these associations, identifying a potential reduction in visual quality as a limitation of traditional fine tuning. We also propose a new methodology with twice-human-in-the-loop (T-HITL) that promises improvements in both reducing problematic associations and also maintaining visual quality. We demonstrate the effectiveness of T-HITL by providing evidence of three problematic associations addressed by T-HITL at the model level. Our contributions to scholarship are two-fold. By defining problematic associations in the context of machine learning models and generative AI, we introduce a conceptual and technical taxonomy for addressing some of these associations. Finally, we provide a method, T-HITL, that addresses these associations and simultaneously maintains visual quality of image model generations. This mitigation need not be a tradeoff, but rather an enhancement.
- Anthropic. Model Card and Evaluations for Claude Models. 2023. https://www-files.anthropic.com/production/images/Model-Card-Claude-2.pdf.
- Break-a-scene: Extracting multiple concepts from a single image. In SIGGRAPH Asia 2023 Conference Papers, SA ’23. ACM, December 2023. 10.1145/3610548.3618154. http://dx.doi.org/10.1145/3610548.3618154.
- Recent advances in adversarial training for adversarial robustness. 2021.
- Easily accessible text-to-image generation amplifies demographic stereotypes at large scale. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, FAccT ’23, page 1493–1504, New York, NY, USA, 2023. Association for Computing Machinery. ISBN 9798400701924. 10.1145/3593013.3594095. https://doi.org/10.1145/3593013.3594095.
- Herbert Blumer. Race prejudice as a sense of group position. The Pacific Sociological Review, 1(1):3–7, 1958. ISSN 00308919. http://www.jstor.org/stable/1388607.
- Leo Chavez. The Latino Threat: Constructing Immigrants, Citizens, and the Nation, Second Edition. Stanford University Press, 2013. ISBN 9780804786188. https://books.google.com/books?id=-CTlKu6In3cC.
- Photoverse: Tuning-free image customization with text-to-image diffusion models. 2023.
- Emu: Enhancing image generation models using photogenic needles in a haystack. Meta, 2023.
- Philip J. Deloria. Playing Indian. Yale historical publications. Yale University Press, 1998. ISBN 9780300071115. https://books.google.com/books?id=4D5nEAAAQBAJ.
- Sander Gilman. Smart Jews: The Construction of the Image of Jewish Superior Intelligence. Abraham Lincoln lecture series. University of Nebraska Press, 1996. ISBN 9780803221581.
- Nicholas D. Hartlep. The Model Minority Stereotype: Demystifying Asian American Success. Information Age Publishing, 2013.
- Jeffrey Herf. The “Jewish War”: Goebbels and the Antisemitic Campaigns of the Nazi Propaganda Ministry. Holocaust and Genocide Studies, 19(1):51–80, 03 2005. ISSN 8756-6583. 10.1093/hgs/dci003. https://doi.org/10.1093/hgs/dci003.
- Simianization: Apes, Gender, Class, and Race. Lit Verlag, 2016.
- Alison M. Jaggar and Susan Bordo, editors. Gender/Body/Knowledge: Feminist Reconstructions of Being and Knowing. Rutgers University Press, New Brunswick, N.J., 1989.
- Theorizing representing the other. In Sue Wilkinson and Celia Kitzinger, editors, Representing the Other: A Feminism & Psychology Reader, pages 1–32. Sage Publications, 1996.
- Unresponsive wakefulness syndrome: a new name for the vegetative state or apallic syndrome. BMC Medicine, 8(68), 2010. 10.1186/1741-7015-8-68.
- Irene López Rodríguez. Of women, bitches, chickens and vixens: Animal metaphors for women in english and spanish. Culture, language and representation, vii:77–100, 2009.
- Framing Muslims. Harvard University Press, 2011. http://www.jstor.org/stable/j.ctt24hkf7.
- OpenAI. GPT-4V(ision) System Card. 2023. https://cdn.openai.com/papers/GPTV_System_Card.pdf.
- Lincoln Quillian. New approaches to understanding racial prejudice and discrimination. Annual Review of Sociology, 32(1):299–328, 2006. 10.1146/annurev.soc.32.061604.123132. https://doi.org/10.1146/annurev.soc.32.061604.123132.
- Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation, 2023.
- Ritch C. Savin-Williams. –and Then I Became Gay: Young Men’s Stories. Routledge, 1998.
- David Livingstone Smith. Less Than Human: Why We Demean, Enslave, and Exterminate Others. St. Martin’s Publishing Group, 2011. ISBN 9781429968560. https://books.google.com/books?id=rWAiRrm3LkcC.
- Dehumanizing representations of women: the shaping of hostile sexist attitudes through animalistic metaphors*. Journal of Gender Studies, 28(1):109–118, 2019. 10.1080/09589236.2017.1411790. https://doi.org/10.1080/09589236.2017.1411790.
- Demonizing the Other: Antisemitism, Racism and Xenophobia. Studies in antisemitism. Harwood Academic, 1999. ISBN 9789057024979. https://books.google.com/books?id=qSwdfHhscmgC.
- Susan Epstein (1 paper)
- Li Chen (590 papers)
- Alessandro Vecchiato (1 paper)
- Ankit Jain (22 papers)