MT-GenEval: A Counterfactual and Contextual Dataset for Evaluating Gender Accuracy in Machine Translation

11/02/2022
by   Anna Currey, et al.
0

As generic machine translation (MT) quality has improved, the need for targeted benchmarks that explore fine-grained aspects of quality has increased. In particular, gender accuracy in translation can have implications in terms of output fluency, translation accuracy, and ethics. In this paper, we introduce MT-GenEval, a benchmark for evaluating gender accuracy in translation from English into eight widely-spoken languages. MT-GenEval complements existing benchmarks by providing realistic, gender-balanced, counterfactual data in eight language pairs where the gender of individuals is unambiguous in the input segment, including multi-sentence segments requiring inter-sentential gender agreement. Our data and code is publicly available under a CC BY SA 3.0 license.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2019

Evaluating Gender Bias in Machine Translation

We present the first challenge set and evaluation protocol for the analy...
research
05/09/2022

CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality

The machine translation (MT) task is typically formulated as that of ret...
research
06/07/2023

Gender, names and other mysteries: Towards the ambiguous for gender-inclusive translation

The vast majority of work on gender in MT focuses on 'unambiguous' input...
research
04/15/2021

Improving Gender Translation Accuracy with Filtered Self-Training

Targeted evaluations have found that machine translation systems often o...
research
06/30/2016

HUME: Human UCCA-Based Evaluation of Machine Translation

Human evaluation of machine translation normally uses sentence-level mea...
research
05/25/2023

What about em? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns

As 3rd-person pronoun usage shifts to include novel forms, e.g., neopron...
research
04/29/2020

Automatically Identifying Gender Issues in Machine Translation using Perturbations

The successful application of neural methods to machine translation has ...

Please sign up or login with your details

Forgot password? Click here to reset