Welcome to the Modern World of Pronouns: Identity-Inclusive Natural Language Processing beyond Gender

02/24/2022
by   Anne Lauscher, et al.
0

The world of pronouns is changing. From a closed class of words with few members to a much more open set of terms to reflect identities. However, Natural Language Processing (NLP) is barely reflecting this linguistic shift, even though recent work outlined the harms of gender-exclusive language technology. Particularly problematic is the current modeling 3rd person pronouns, as it largely ignores various phenomena like neopronouns, i.e., pronoun sets that are novel and not (yet) widely established. This omission contributes to the discrimination of marginalized and underrepresented groups, e.g., non-binary individuals. However, other identity-expression phenomena beyond gender are also ignored by current NLP technology. In this paper, we provide an overview of 3rd person pronoun issues for NLP. Based on our observations and ethical considerations, we define a series of desiderata for modeling pronouns in language technology. We evaluate existing and novel modeling approaches w.r.t. these desiderata qualitatively, and quantify the impact of a more discrimination-free approach on established benchmark data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/01/2021

Unpacking the Interdependent Systems of Discrimination: Ableist Bias in NLP Systems through an Intersectional Lens

Much of the world's population experiences some form of disability durin...
research
07/31/2018

Gender Bias in Neural Natural Language Processing

We examine whether neural natural language processing (NLP) systems refl...
research
04/19/2023

Radar de Parité: An NLP system to measure gender representation in French news stories

We present the Radar de Parité, an automated Natural Language Processing...
research
02/12/2021

They, Them, Theirs: Rewriting with Gender-Neutral English

Responsible development of technology involves applications being inclus...
research
05/17/2023

"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

Transgender and non-binary (TGNB) individuals disproportionately experie...
research
02/13/2023

Linguistic ambiguity analysis in ChatGPT

Linguistic ambiguity is and has always been one of the main challenges i...
research
06/14/2019

Principled Frameworks for Evaluating Ethics in NLP Systems

We critique recent work on ethics in natural language processing. Those ...

Please sign up or login with your details

Forgot password? Click here to reset