Measuring Normative and Descriptive Biases in Language Models Using Census Data

04/12/2023
by   Samia Touileb, et al.
0

We investigate in this paper how distributions of occupations with respect to gender is reflected in pre-trained language models. Such distributions are not always aligned to normative ideals, nor do they necessarily reflect a descriptive assessment of reality. In this paper, we introduce an approach for measuring to what degree pre-trained language models are aligned to normative and descriptive occupational distributions. To this end, we use official demographic information about gender–occupation distributions provided by the national statistics agencies of France, Norway, United Kingdom, and the United States. We manually generate template-based sentences combining gendered pronouns and nouns with occupations, and subsequently probe a selection of ten language models covering the English, French, and Norwegian languages. The scoring system we introduce in this work is language independent, and can be used on any combination of template-based sentences, occupations, and languages. The approach could also be extended to other dimensions of national census data and other demographic variables.

READ FULL TEXT
research
04/12/2023

Measuring Gender Bias in West Slavic Language Models

Pre-trained language models have been known to perpetuate biases from th...
research
11/21/2022

Measuring Harmful Representations in Scandinavian Language Models

Scandinavian countries are perceived as role-models when it comes to gen...
research
05/12/2022

Using Natural Sentences for Understanding Biases in Language Models

Evaluation of biases in language models is often limited to syntheticall...
research
12/10/2020

As good as new. How to successfully recycle English GPT-2 to make models for other languages

Large generative language models have been very successful for English, ...
research
03/26/2022

Metaphors in Pre-Trained Language Models: Probing and Generalization Across Datasets and Languages

Human languages are full of metaphorical expressions. Metaphors help peo...
research
12/05/2022

INCLUSIFY: A benchmark and a model for gender-inclusive German

Gender-inclusive language is important for achieving gender equality in ...
research
07/04/2023

The Inner Sentiments of a Thought

Transformer-based large-scale language models (LLMs) are able to generat...

Please sign up or login with your details

Forgot password? Click here to reset