Assessing the Reliability of Word Embedding Gender Bias Measures

09/10/2021
by   Yupei Du, et al.
20

Various measures have been proposed to quantify human-like social biases in word embeddings. However, bias scores based on these measures can suffer from measurement error. One indication of measurement quality is reliability, concerning the extent to which a measure produces consistent results. In this paper, we assess three types of reliability of word embedding gender bias measures, namely test-retest reliability, inter-rater consistency and internal consistency. Specifically, we investigate the consistency of bias scores across different choices of random seeds, scoring rules and words. Furthermore, we analyse the effects of various factors on these measures' reliability scores. Our findings inform better design of word embedding gender bias measures. Moreover, we urge researchers to be more critical about the application of such measures.

READ FULL TEXT

page 16

page 19

research
04/18/2019

Evaluating the Underlying Gender Bias in Contextualized Word Embeddings

Gender bias is highly impacting natural language processing applications...
research
10/06/2020

Robustness and Reliability of Gender Bias Assessment in WordEmbeddings: The Role of Base Pairs

It has been shown that word embeddings can exhibit gender bias, and vari...
research
10/11/2022

Social-Group-Agnostic Word Embedding Debiasing via the Stereotype Content Model

Existing word embedding debiasing methods require social-group-specific ...
research
08/21/2018

Downsampling Strategies are Crucial for Word Embedding Reliability

The reliability of word embeddings algorithms, i.e., their ability to pr...
research
04/17/2020

Wide range screening of algorithmic bias in word embedding models using large sentiment lexicons reveals underreported bias types

Concerns about gender bias in word embedding models have captured substa...
research
01/28/2023

Comparing Intrinsic Gender Bias Evaluation Measures without using Human Annotated Examples

Numerous types of social biases have been identified in pre-trained lang...
research
03/28/2022

The SAME score: Improved cosine based bias score for word embeddings

Over the last years, word and sentence embeddings have established as te...

Please sign up or login with your details

Forgot password? Click here to reset