Effective General-Domain Data Inclusion for the Machine Translation Task by Vanilla Transformers

09/28/2022
by   Hassan Soliman, et al.
0

One of the vital breakthroughs in the history of machine translation is the development of the Transformer model. Not only it is revolutionary for various translation tasks, but also for a majority of other NLP tasks. In this paper, we aim at a Transformer-based system that is able to translate a source sentence in German to its counterpart target sentence in English. We perform the experiments on the news commentary German-English parallel sentences from the WMT'13 dataset. In addition, we investigate the effect of the inclusion of additional general-domain data in training from the IWSLT'16 dataset to improve the Transformer model performance. We find that including the IWSLT'16 dataset in training helps achieve a gain of 2 BLEU score points on the test set of the WMT'13 dataset. Qualitative analysis is introduced to analyze how the usage of general-domain data helps improve the quality of the produced translation sentences.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2021

English-Twi Parallel Corpus for Machine Translation

We present a parallel machine translation training corpus for English an...
research
06/08/2020

Wat zei je? Detecting Out-of-Distribution Translations with Variational Transformers

We detect out-of-training-distribution sentences in Neural Machine Trans...
research
10/11/2020

Machine Translation of Mathematical Text

We have implemented a machine translation system, the PolyMath Translato...
research
10/12/2022

Improved Data Augmentation for Translation Suggestion

Translation suggestion (TS) models are used to automatically provide alt...
research
07/03/2021

Can Transformers Jump Around Right in Natural Language? Assessing Performance Transfer from SCAN

Despite their practical success, modern seq2seq architectures are unable...
research
06/27/2021

Power Law Graph Transformer for Machine Translation and Representation Learning

We present the Power Law Graph Transformer, a transformer model with wel...
research
11/07/2018

Data Selection with Feature Decay Algorithms Using an Approximated Target Side

Data selection techniques applied to neural machine translation (NMT) ai...

Please sign up or login with your details

Forgot password? Click here to reset