Improving Robustness of Retrieval Augmented Translation via Shuffling of Suggestions

10/11/2022
by   Cuong Hoang, et al.
0

Several recent studies have reported dramatic performance improvements in neural machine translation (NMT) by augmenting translation at inference time with fuzzy-matches retrieved from a translation memory (TM). However, these studies all operate under the assumption that the TMs available at test time are highly relevant to the testset. We demonstrate that for existing retrieval augmented translation methods, using a TM with a domain mismatch to the test set can result in substantially worse performance compared to not using a TM at all. We propose a simple method to expose fuzzy-match NMT systems during training and show that it results in a system that is much more tolerant (regaining up to 5.8 BLEU) to inference with TMs with domain mismatch. Also, the model is still competitive to the baseline when fed with suggestions from relevant TMs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/10/2022

Improving Retrieval Augmented Neural Machine Translation by Controlling Source and Fuzzy-Match Interactions

We explore zero-shot adaptation, where a general-domain model has access...
research
06/19/2018

Learning from Chunk-based Feedback in Neural Machine Translation

We empirically investigate learning from partial feedback in neural mach...
research
08/07/2017

Memory-augmented Neural Machine Translation

Neural machine translation (NMT) has achieved notable success in recent ...
research
11/02/2018

Improving the Robustness of Speech Translation

Although neural machine translation (NMT) has achieved impressive progre...
research
08/14/2019

Adabot: Fault-Tolerant Java Decompiler

Reverse Engineering(RE) has been a fundamental task in software engineer...
research
05/24/2021

Neural Machine Translation with Monolingual Translation Memory

Prior work has proved that Translation memory (TM) can boost the perform...
research
05/12/2021

Improving Lexically Constrained Neural Machine Translation with Source-Conditioned Masked Span Prediction

Generating accurate terminology is a crucial component for the practical...

Please sign up or login with your details

Forgot password? Click here to reset