Gender in Danger? Evaluating Speech Translation Technology on the MuST-SHE Corpus

06/10/2020
by   Luisa Bentivogli, et al.
0

Translating from languages without productive grammatical gender like English into gender-marked languages is a well-known difficulty for machines. This difficulty is also due to the fact that the training data on which models are built typically reflect the asymmetries of natural languages, gender bias included. Exclusively fed with textual data, machine translation is intrinsically constrained by the fact that the input sentence does not always contain clues about the gender identity of the referred human entities. But what happens with speech translation, where the input is an audio signal? Can audio provide additional information to reduce gender bias? We present the first thorough investigation of gender bias in speech translation, contributing with: i) the release of a benchmark useful for future studies, and ii) the comparison of different technologies (cascade and end-to-end) on two language directions (English-Italian/French).

READ FULL TEXT
research
12/09/2020

Breeding Gender-aware Direct Speech Translation Systems

In automatic speech translation (ST), traditional cascade approaches inv...
research
10/27/2020

Evaluating Gender Bias in Speech Translation

The scientific community is more and more aware of the necessity to embr...
research
05/18/2023

Exploiting Biased Models to De-bias Text: A Gender-Fair Rewriting Model

Natural language generation models reproduce and often amplify the biase...
research
03/18/2022

Under the Morphosyntactic Lens: A Multifaceted Evaluation of Gender Bias in Speech Translation

Gender bias is largely recognized as a problematic phenomenon affecting ...
research
05/28/2021

How to Split: the Effect of Word Segmentation on Gender Bias in Speech Translation

Having recognized gender bias as a major issue affecting current transla...
research
09/15/2021

Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation

Automatic translation systems are known to struggle with rare words. Amo...

Please sign up or login with your details

Forgot password? Click here to reset