Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Grep-BiasIR: A Dataset for Investigating Gender Representation-Bias in Information Retrieval Results (2201.07754v3)

Published 19 Jan 2022 in cs.IR

Abstract: The provided contents by information retrieval (IR) systems can reflect the existing societal biases and stereotypes. Such biases in retrieval results can lead to further establishing and strengthening stereotypes in society and also in the systems. To facilitate the studies of gender bias in the retrieval results of IR systems, we introduce Gender Representation-Bias for Information Retrieval (Grep-BiasIR), a novel thoroughly-audited dataset consisting of 118 bias-sensitive neutral search queries. The set of queries covers a wide range of gender-related topics, for which a biased representation of genders in the search result can be considered as socially problematic. Each query is accompanied with one relevant and one non-relevant document, where the document is also provided in three variations of female, male, and neutral. The dataset is available at https://github.com/KlaraKrieg/GrepBiasIR.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Klara Krieg (2 papers)
  2. Emilia Parada-Cabaleiro (7 papers)
  3. Gertraud Medicus (1 paper)
  4. Oleg Lesota (10 papers)
  5. Markus Schedl (48 papers)
  6. Navid Rekabsaz (31 papers)
Citations (25)

Summary

We haven't generated a summary for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com