Improving Polish to English Neural Machine Translation with Transfer Learning: Effects of Data Volume and Language Similarity

06/01/2023
by   Juuso Eronen, et al.
0

This paper investigates the impact of data volume and the use of similar languages on transfer learning in a machine translation task. We find out that having more data generally leads to better performance, as it allows the model to learn more patterns and generalizations from the data. However, related languages can also be particularly effective when there is limited data available for a specific language pair, as the model can leverage the similarities between the languages to improve performance. To demonstrate, we fine-tune mBART model for a Polish-English translation task using the OPUS-100 dataset. We evaluate the performance of the model under various transfer learning configurations, including different transfer source languages and different shot levels for Polish, and report the results. Our experiments show that a combination of related languages and larger amounts of data outperforms the model trained on related languages or larger amounts of data alone. Additionally, we show the importance of related languages in zero-shot and few-shot configurations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/01/2021

Low-Resource Neural Machine Translation for Southern African Languages

Low-resource African languages have not fully benefited from the progres...
research
03/09/2020

Tigrinya Neural Machine Translation with Transfer Learning for Humanitarian Response

We report our experiments in building a domain-specific Tigrinya-to-Engl...
research
08/07/2021

Improving Similar Language Translation With Transfer Learning

We investigate transfer learning based on pre-trained neural machine tra...
research
03/31/2021

Zero-Shot Language Transfer vs Iterative Back Translation for Unsupervised Machine Translation

This work focuses on comparing different solutions for machine translati...
research
05/22/2023

Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models

This study investigates machine translation between related languages i....
research
03/21/2022

Transformer-based HTR for Historical Documents

We apply the TrOCR framework to real-world, historical manuscripts and s...
research
06/03/2020

Transfer Learning for British Sign Language Modelling

Automatic speech recognition and spoken dialogue systems have made great...

Please sign up or login with your details

Forgot password? Click here to reset