Ensemble-based Transfer Learning for Low-resource Machine Translation Quality Estimation

05/17/2021
by   Ting-Wei Wu, et al.
0

Quality Estimation (QE) of Machine Translation (MT) is a task to estimate the quality scores for given translation outputs from an unknown MT system. However, QE scores for low-resource languages are usually intractable and hard to collect. In this paper, we focus on the Sentence-Level QE Shared Task of the Fifth Conference on Machine Translation (WMT20), but in a more challenging setting. We aim to predict QE scores of given translation outputs when barely none of QE scores of that paired languages are given during training. We propose an ensemble-based predictor-estimator QE model with transfer learning to overcome such QE data scarcity challenge by leveraging QE scores from other miscellaneous languages and translation results of targeted languages. Based on the evaluation results, we provide a detailed analysis of how each of our extension affects QE models on the reliability and the generalization ability to perform transfer learning under multilingual tasks. Finally, we achieve the best performance on the ensemble model combining the models pretrained by individual languages as well as different levels of parallel trained corpus with a Pearson's correlation of 0.298, which is 2.54 times higher than baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/14/2022

Improving Neural Machine Translation of Indigenous Languages with Multilingual Transfer Learning

Machine translation (MT) involving Indigenous languages, including those...
research
05/27/2023

Parallel Corpus for Indigenous Language Translation: Spanish-Mazatec and Spanish-Mixtec

In this paper, we present a parallel Spanish-Mazatec and Spanish-Mixtec ...
research
05/18/2022

PreQuEL: Quality Estimation of Machine Translation Outputs in Advance

We present the task of PreQuEL, Pre-(Quality-Estimation) Learning. A Pre...
research
06/07/2020

Growing Together: Modeling Human Language Learning With n-Best Multi-Checkpoint Machine Translation

We describe our submission to the 2020 Duolingo Shared Task on Simultane...
research
07/24/2021

MDQE: A More Accurate Direct Pretraining for Machine Translation Quality Estimation

It is expensive to evaluate the results of Machine Translation(MT), whic...
research
09/30/2020

On Romanization for Model Transfer Between Scripts in Neural Machine Translation

Transfer learning is a popular strategy to improve the quality of low-re...
research
03/24/2023

Towards Making the Most of ChatGPT for Machine Translation

ChatGPT shows remarkable capabilities for machine translation (MT). Seve...

Please sign up or login with your details

Forgot password? Click here to reset