Biased Embeddings from Wild Data: Measuring, Understanding and Removing

06/16/2018
by   Adam Sutton, et al.
0

Many modern Artificial Intelligence (AI) systems make use of data embeddings, particularly in the domain of Natural Language Processing (NLP). These embeddings are learnt from data that has been gathered "from the wild" and have been found to contain unwanted biases. In this paper we make three contributions towards measuring, understanding and removing this problem. We present a rigorous way to measure some of these biases, based on the use of word lists created for social psychology applications; we observe how gender bias in occupations reflects actual gender bias in the same occupations in the real world; and finally we demonstrate how a simple projection can significantly reduce the effects of embedding bias. All this is part of an ongoing effort to understand how trust can be built into AI systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/18/2023

Unveiling Gender Bias in Terms of Profession Across LLMs: Analyzing and Addressing Sociological Implications

Gender bias in artificial intelligence (AI) and natural language process...
research
10/31/2020

Evaluating Bias In Dutch Word Embeddings

Recent research in Natural Language Processing has revealed that word em...
research
05/21/2023

BiasAsker: Measuring the Bias in Conversational AI System

Powered by advanced Artificial Intelligence (AI) techniques, conversatio...
research
10/06/2020

Robustness and Reliability of Gender Bias Assessment in WordEmbeddings: The Role of Base Pairs

It has been shown that word embeddings can exhibit gender bias, and vari...
research
03/16/2023

Lessons Learnt from a Multimodal Learning Analytics Deployment In-the-wild

Multimodal Learning Analytics (MMLA) innovations make use of rapidly evo...
research
06/29/2021

Sexism in the Judiciary

We analyze 6.7 million case law documents to determine the presence of g...
research
02/27/2023

Diversity matters: Robustness of bias measurements in Wikidata

With the widespread use of knowledge graphs (KG) in various automated AI...

Please sign up or login with your details

Forgot password? Click here to reset