FAIntbench: A Holistic and Precise Benchmark for Bias Evaluation in Text-to-Image Models (2405.17814v5)

Published 28 May 2024 in cs.CV and cs.AI

Abstract: The rapid development and reduced barriers to entry for Text-to-Image (T2I) models have raised concerns about the biases in their outputs, but existing research lacks a holistic definition and evaluation framework of biases, limiting the enhancement of debiasing techniques. To address this issue, we introduce FAIntbench, a holistic and precise benchmark for biases in T2I models. In contrast to existing benchmarks that evaluate bias in limited aspects, FAIntbench evaluate biases from four dimensions: manifestation of bias, visibility of bias, acquired attributes, and protected attributes. We applied FAIntbench to evaluate seven recent large-scale T2I models and conducted human evaluation, whose results demonstrated the effectiveness of FAIntbench in identifying various biases. Our study also revealed new research questions about biases, including the side-effect of distillation. The findings presented here are preliminary, highlighting the potential of FAIntbench to advance future research aimed at mitigating the biases in T2I models. Our benchmark is publicly available to ensure the reproducibility.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (45)

Authors (4)

Hanjun Luo (8 papers)
Ziye Deng (3 papers)
Ruizhe Chen (32 papers)
Zuozhu Liu (78 papers)

Citations (5)

View on Semantic Scholar

Tweets

https://twitter.com/CSVisionPapers/status/1795921422516191477

FAIntbench: A Holistic and Precise Benchmark for Bias Evaluation in Text-to-Image Models (2405.17814v5)

Related Papers

Tweets