Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improving Diversity of Demographic Representation in Large Language Models via Collective-Critiques and Self-Voting (2310.16523v1)

Published 25 Oct 2023 in cs.CL and cs.AI

Abstract: A crucial challenge for generative LLMs is diversity: when a user's prompt is under-specified, models may follow implicit assumptions while generating a response, which may result in homogenization of the responses, as well as certain demographic groups being under-represented or even erased from the generated responses. In this paper, we formalize diversity of representation in generative LLMs. We present evaluation datasets and propose metrics to measure diversity in generated responses along people and culture axes. We find that LLMs understand the notion of diversity, and that they can reason and critique their own responses for that goal. This finding motivated a new prompting technique called collective-critique and self-voting (CCSV) to self-improve people diversity of LLMs by tapping into its diversity reasoning capabilities, without relying on handcrafted examples or prompt tuning. Extensive empirical experiments with both human and automated evaluations show that our proposed approach is effective at improving people and culture diversity, and outperforms all baseline methods by a large margin.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Preethi Lahoti (13 papers)
  2. Nicholas Blumm (2 papers)
  3. Xiao Ma (169 papers)
  4. Raghavendra Kotikalapudi (1 paper)
  5. Sahitya Potluri (3 papers)
  6. Qijun Tan (11 papers)
  7. Hansa Srinivasan (6 papers)
  8. Ben Packer (11 papers)
  9. Ahmad Beirami (86 papers)
  10. Alex Beutel (52 papers)
  11. Jilin Chen (32 papers)
Citations (22)
X Twitter Logo Streamline Icon: https://streamlinehq.com