Repairing Pronouns in Translation with BERT-Based Post-Editing

03/23/2021
by   Reid Pryzant, et al.
0

Pronouns are important determinants of a text's meaning but difficult to translate. This is because pronoun choice can depend on entities described in previous sentences, and in some languages pronouns may be dropped when the referent is inferrable from the context. These issues can lead Neural Machine Translation (NMT) systems to make critical errors on pronouns that impair intelligibility and even reinforce gender bias. We investigate the severity of this pronoun issue, showing that (1) in some domains, pronoun choice can account for more than half of a NMT systems' errors, and (2) pronouns have a disproportionately large impact on perceived translation quality. We then investigate a possible solution: fine-tuning BERT on a pronoun prediction task using chunks of source-side sentences, then using the resulting classifier to repair the translations of an existing NMT model. We offer an initial case study of this approach for the Japanese-English language pair, observing that a small number of translations are significantly improved according to human evaluators.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2016

Pre-Translation for Neural Machine Translation

Recently, the development of neural machine translation (NMT) has signif...
research
04/09/2019

Text Repair Model for Neural Machine Translation

In this work, we train a text repair model as a post-processor for Neura...
research
02/21/2022

Domain Adaptation in Neural Machine Translation using a Qualia-Enriched FrameNet

In this paper we present Scylla, a methodology for domain adaptation of ...
research
09/30/2020

Can Automatic Post-Editing Improve NMT?

Automatic post-editing (APE) aims to improve machine translations, there...
research
12/24/2020

Why Neural Machine Translation Prefers Empty Outputs

We investigate why neural machine translation (NMT) systems assign high ...
research
02/17/2020

Incorporating BERT into Neural Machine Translation

The recently proposed BERT has shown great power on a variety of natural...
research
10/26/2020

Is it Great or Terrible? Preserving Sentiment in Neural Machine Translation of Arabic Reviews

Since the advent of Neural Machine Translation (NMT) approaches there ha...

Please sign up or login with your details

Forgot password? Click here to reset