Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

09/08/2021
by   Shahar Levy, et al.
0

Recent works have found evidence of gender bias in models of machine translation and coreference resolution using mostly synthetic diagnostic datasets. While these quantify bias in a controlled experiment, they often do so on a small scale and consist mostly of artificial, out-of-distribution sentences. In this work, we find grammatical patterns indicating stereotypical and non-stereotypical gender-role assignments (e.g., female nurses versus male dancers) in corpora from three domains, resulting in a first large-scale gender bias dataset of 108K diverse real-world English sentences. We manually verify the quality of our corpus and use it to evaluate gender bias in various coreference resolution and machine translation models. We find that all tested models tend to over-rely on gender stereotypes when presented with natural inputs, which may be especially harmful when deployed in commercial systems. Finally, we show that our dataset lends itself to finetuning a coreference resolution model, finding it mitigates bias on a held out set. Our dataset and models are publicly available at www.github.com/SLAB-NLP/BUG. We hope they will spur future research into gender bias evaluation mitigation techniques in realistic settings.

READ FULL TEXT
research
06/03/2019

Evaluating Gender Bias in Machine Translation

We present the first challenge set and evaluation protocol for the analy...
research
04/25/2018

Gender Bias in Coreference Resolution

We present an empirical study of gender bias in coreference resolution s...
research
10/12/2020

Gender Coreference and Bias Evaluation at WMT 2020

Gender bias in machine translation can manifest when choosing gender inf...
research
04/29/2020

Automatically Identifying Gender Issues in Machine Translation using Perturbations

The successful application of neural methods to machine translation has ...
research
03/20/2022

Mitigating Gender Bias in Machine Translation through Adversarial Learning

Machine translation and other NLP systems often contain significant bias...
research
04/18/2018

Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods

We introduce a new benchmark, WinoBias, for coreference resolution focus...
research
05/23/2023

Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in Multilingual Machine Translation

Gender bias is a significant issue in machine translation, leading to on...

Please sign up or login with your details

Forgot password? Click here to reset