Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement

10/18/2021
by   Hyojung Han, et al.
6

Recent work in simultaneous machine translation is often trained with conventional full sentence translation corpora, leading to either excessive latency or necessity to anticipate as-yet-unarrived words, when dealing with a language pair whose word orders significantly differ. This is unlike human simultaneous interpreters who produce largely monotonic translations at the expense of the grammaticality of a sentence being translated. In this paper, we thus propose an algorithm to reorder and refine the target side of a full sentence translation corpus, so that the words/phrases between the source and target sentences are aligned largely monotonically, using word alignment and non-autoregressive neural machine translation. We then train a widely used wait-k simultaneous translation model on this reordered-and-refined corpus. The proposed approach improves BLEU scores and resulting translations exhibit enhanced monotonicity with source sentences.

READ FULL TEXT
research
11/03/2019

Machine Translation in Pronunciation Space

The research in machine translation community focus on translation in te...
research
11/21/2019

Minimizing the Bag-of-Ngrams Difference for Non-Autoregressive Neural Machine Translation

Non-Autoregressive Neural Machine Translation (NAT) achieves significant...
research
10/21/2020

Improving Simultaneous Translation with Pseudo References

Simultaneous translation is vastly different from full-sentence translat...
research
10/08/2022

Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation

Non-autoregressive translation (NAT) models are typically trained with t...
research
08/05/2016

Winograd Schemas and Machine Translation

A Winograd schema is a pair of sentences that differ in a single word an...
research
05/24/2022

Lack of Fluency is Hurting Your Translation Model

Many machine translation models are trained on bilingual corpus, which c...
research
06/03/2019

From Words to Sentences: A Progressive Learning Approach for Zero-resource Machine Translation with Visual Pivots

The neural machine translation model has suffered from the lack of large...

Please sign up or login with your details

Forgot password? Click here to reset