Improving a Multi-Source Neural Machine Translation Model with Corpus Extension for Low-Resource Languages

09/26/2017
by   Gyu-Hyeon Choi, et al.
0

In machine translation, we often try to collect resources to improve its performance. However, most of the language pairs don't have enough resources to train machine translation systems. In this paper, we propose to use synthetic methods for extending a low resource corpus and apply it to a multi source neural machine translation model. We showed the improvement of machine translation performance through the corpus extension using the synthetic method. Especially, we focused on how to create source sentences that can make better target sentences, even using synthetic methods. And we found that the corpus extension could also improve the performance of a multi source neural machine translation. We showed the corpus extension and multi source model to be an efficient method for a low-resource language pair. Furthermore, when both methods were used together, we found better machine translation performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/01/2019

Application of Low-resource Machine Translation Techniques to Russian-Tatar Language Pair

Neural machine translation is the current state-of-the-art in machine tr...
research
03/20/2021

The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation

This paper evaluates the performance of several modern subword segmentat...
research
05/22/2020

Simplify-then-Translate: Automatic Preprocessing for Black-Box Machine Translation

Black-box machine translation systems have proven incredibly useful for ...
research
04/09/2020

Self-Training for Unsupervised Neural Machine Translation in Unbalanced Training Data Scenarios

Unsupervised neural machine translation (UNMT) that relies solely on mas...
research
03/19/2021

Congolese Swahili Machine Translation for Humanitarian Response

In this paper we describe our efforts to make a bidirectional Congolese ...
research
07/05/2016

Target-Side Context for Discriminative Models in Statistical Machine Translation

Discriminative translation models utilizing source context have been sho...

Please sign up or login with your details

Forgot password? Click here to reset