On Search Strategies for Document-Level Neural Machine Translation

06/08/2023
by   Christian Herold, et al.
0

Compared to sentence-level systems, document-level neural machine translation (NMT) models produce a more consistent output across a document and are able to better resolve ambiguities within the input. There are many works on document-level NMT, mostly focusing on modifying the model architecture or training strategy to better accommodate the additional context-input. On the other hand, in most works, the question on how to perform search with the trained model is scarcely discussed, sometimes not mentioned at all. In this work, we aim to answer the question how to best utilize a context-aware translation model in decoding. We start with the most popular document-level NMT approach and compare different decoding schemes, some from the literature and others proposed by us. In the comparison, we are using both, standard automatic metrics, as well as specific linguistic phenomena on three standard document-level translation benchmarks. We find that most commonly used decoding strategies perform similar to each other and that higher quality context information has the potential to further improve the translation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2020

Diving Deep into Context-Aware Neural Machine Translation

Context-aware neural machine translation (NMT) is a promising direction ...
research
04/20/2023

Exploring Paracrawl for Document-level Neural Machine Translation

Document-level neural machine translation (NMT) has outperformed sentenc...
research
09/16/2020

Document-level Neural Machine Translation with Document Embeddings

Standard neural machine translation (NMT) is on the assumption of docume...
research
01/26/2021

A Comparison of Approaches to Document-level Machine Translation

Document-level machine translation conditions on surrounding sentences t...
research
04/25/2023

Escaping the sentence-level paradigm in machine translation

It is well-known that document context is vital for resolving a range of...
research
09/07/2022

Adam Mickiewicz University at WMT 2022: NER-Assisted and Quality-Aware Neural Machine Translation

This paper presents Adam Mickiewicz University's (AMU) submissions to th...
research
06/08/2023

Improving Long Context Document-Level Machine Translation

Document-level context for neural machine translation (NMT) is crucial t...

Please sign up or login with your details

Forgot password? Click here to reset