Grep-BiasIR: A Dataset for Investigating Gender Representation-Bias in Information Retrieval Results

01/19/2022
by   Klara Krieg, et al.
0

The provided contents by information retrieval (IR) systems can reflect the existing societal biases and stereotypes. Such biases in retrieval results can lead to further establishing and strengthening stereotypes in society and also in the systems. To facilitate the studies of gender bias in the retrieval results of IR systems, we introduce Gender Representation-Bias for Information Retrieval (Grep-BiasIR), a novel thoroughly-audited dataset consisting of 118 bias-sensitive neutral search queries. The set of queries covers a wide range of gender-related topics, for which a biased representation of genders in the search result can be considered as socially problematic. Each query is accompanied with one relevant and one non-relevant documents, where the document is also provided in three variations of female, male, and neutral. The dataset is available at https://github.com/KlaraKrieg/GrepBiasIR.

READ FULL TEXT

page 1

page 2

page 3

research
03/03/2022

Do Perceived Gender Biases in Retrieval Results Affect Relevance Judgements?

This work investigates the effect of gender-stereotypical biases in the ...
research
05/01/2020

Do Neural Ranking Models Intensify Gender Bias?

Concerns regarding the footprint of societal biases in information retri...
research
09/13/2019

Recommendation or Discrimination?: Quantifying Distribution Parity in Information Retrieval Systems

Information retrieval (IR) systems often leverage query data to suggest ...
research
04/16/2020

ViBE: A Tool for Measuring and Mitigating Bias in Image Datasets

Machine learning models are known to perpetuate the biases present in th...
research
08/02/2022

Debiasing Gender Bias in Information Retrieval Models

Biases in culture, gender, ethnicity, etc. have existed for decades and ...
research
06/26/2021

Detecting race and gender bias in visual representation of AI on web search engines

Web search engines influence perception of social reality by filtering a...
research
11/25/2021

Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators

Heavily pre-trained transformers for language modelling, such as BERT, h...

Please sign up or login with your details

Forgot password? Click here to reset