Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Building Socio-culturally Inclusive Stereotype Resources with Community Engagement (2307.10514v1)

Published 20 Jul 2023 in cs.CL, cs.AI, and cs.HC

Abstract: With rapid development and deployment of generative LLMs in global settings, there is an urgent need to also scale our measurements of harm, not just in the number and types of harms covered, but also how well they account for local cultural contexts, including marginalized identities and the social biases experienced by them. Current evaluation paradigms are limited in their abilities to address this, as they are not representative of diverse, locally situated but global, socio-cultural perspectives. It is imperative that our evaluation resources are enhanced and calibrated by including people and experiences from different cultures and societies worldwide, in order to prevent gross underestimations or skews in measurements of harm. In this work, we demonstrate a socio-culturally aware expansion of evaluation resources in the Indian societal context, specifically for the harm of stereotyping. We devise a community engaged effort to build a resource which contains stereotypes for axes of disparity that are uniquely present in India. The resultant resource increases the number of stereotypes known for and in the Indian context by over 1000 stereotypes across many unique identities. We also demonstrate the utility and effectiveness of such expanded resources for evaluations of LLMs. CONTENT WARNING: This paper contains examples of stereotypes that may be offensive.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Sunipa Dev (28 papers)
  2. Jaya Goyal (1 paper)
  3. Dinesh Tewari (4 papers)
  4. Shachi Dave (12 papers)
  5. Vinodkumar Prabhakaran (48 papers)
Citations (14)