Breeding Gender-aware Direct Speech Translation Systems

12/09/2020
by   Marco Gaido, et al.
3

In automatic speech translation (ST), traditional cascade approaches involving separate transcription and translation steps are giving ground to increasingly competitive and more robust direct solutions. In particular, by translating speech audio data without intermediate transcription, direct ST models are able to leverage and preserve essential information present in the input (e.g. speaker's vocal characteristics) that is otherwise lost in the cascade framework. Although such ability proved to be useful for gender translation, direct ST is nonetheless affected by gender bias just like its cascade counterpart, as well as machine translation and numerous other natural language processing applications. Moreover, direct ST systems that exclusively rely on vocal biometric features as a gender cue can be unsuitable and potentially harmful for certain users. Going beyond speech signals, in this paper we compare different approaches to inform direct ST models about the speaker's gender and test their ability to handle gender translation from English into Italian and French. To this aim, we manually annotated large datasets with speakers' gender information and used them for experiments reflecting different possible real-world scenarios. Our results show that gender-aware direct ST solutions can significantly outperform strong - but gender-unaware - direct ST models. In particular, the translation of gender-marked words can increase up to 30 points in accuracy while preserving overall translation quality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2020

Gender in Danger? Evaluating Speech Translation Technology on the MuST-SHE Corpus

Translating from languages without productive grammatical gender like En...
research
02/26/2018

Gender Aware Spoken Language Translation Applied to English-Arabic

Spoken Language Translation (SLT) is becoming more widely used and becom...
research
10/27/2020

Evaluating Gender Bias in Speech Translation

The scientific community is more and more aware of the necessity to embr...
research
06/02/2021

Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference?

Five years after the first published proofs of concept, direct approache...
research
05/28/2021

How to Split: the Effect of Word Segmentation on Gender Bias in Speech Translation

Having recognized gender bias as a major issue affecting current transla...
research
09/11/2019

Getting Gender Right in Neural Machine Translation

Speakers of different languages must attend to and encode strikingly dif...
research
08/23/2019

Gender Representation in French Broadcast Corpora and Its Impact on ASR Performance

This paper analyzes the gender representation in four major corpora of F...

Please sign up or login with your details

Forgot password? Click here to reset