Building a Neural Machine Translation System Using Only Synthetic Parallel Data

04/02/2017
by   Jaehong Park, et al.
0

Recent works have shown that synthetic parallel data automatically generated by translation models can be effective for various neural machine translation (NMT) issues. In this study, we build NMT systems using only synthetic parallel data. As an efficient alternative to real parallel data, we also present a new type of synthetic parallel corpus. The proposed pseudo parallel data are distinct from previous works in that ground truth and synthetic examples are mixed on both sides of sentence pairs. Experiments on Czech-German and French-German translations demonstrate the efficacy of the proposed pseudo parallel corpus, which shows not only enhanced results for bidirectional translation tasks but also substantial improvement with the aid of a ground truth real parallel corpus.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/22/2019

Corpus Augmentation by Sentence Segmentation for Low-Resource Neural Machine Translation

Neural Machine Translation (NMT) has been proven to achieve impressive r...
research
04/20/2023

Exploring Paracrawl for Document-level Neural Machine Translation

Document-level neural machine translation (NMT) has outperformed sentenc...
research
02/15/2021

Meta Back-translation

Back-translation is an effective strategy to improve the performance of ...
research
03/29/2018

Identifying Semantic Divergences in Parallel Text without Annotations

Recognizing that even correct translations are not always semantically e...
research
12/05/2020

Reciprocal Supervised Learning Improves Neural Machine Translation

Despite the recent success on image classification, self-training has on...
research
04/05/2020

AR: Auto-Repair the Synthetic Data for Neural Machine Translation

Compared with only using limited authentic parallel data as training cor...
research
09/19/2018

NICT's Corpus Filtering Systems for the WMT18 Parallel Corpus Filtering Task

This paper presents the NICT's participation in the WMT18 shared paralle...

Please sign up or login with your details

Forgot password? Click here to reset