Alibaba-Translate China's Submission for WMT 2022 Quality Estimation Shared Task

10/18/2022
by   Keqin Bao, et al.
0

In this paper, we present our submission to the sentence-level MQM benchmark at Quality Estimation Shared Task, named UniTE (Unified Translation Evaluation). Specifically, our systems employ the framework of UniTE, which combined three types of input formats during training with a pre-trained language model. First, we apply the pseudo-labeled data examples for the continuously pre-training phase. Notably, to reduce the gap between pre-training and fine-tuning, we use data pruning and a ranking-based score normalization strategy. For the fine-tuning phase, we use both Direct Assessment (DA) and Multidimensional Quality Metrics (MQM) data from past years' WMT competitions. Finally, we collect the source-only evaluation results, and ensemble the predictions generated by two UniTE models, whose backbones are XLM-R and InfoXLM, respectively. Results show that our models reach 1st overall ranking in the Multilingual and English-Russian settings, and 2nd overall ranking in English-German and Chinese-English settings, showing relatively strong performances in this year's quality estimation competition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2022

Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task

In this report, we present our submission to the WMT 2022 Metrics Shared...
research
11/28/2022

BJTU-WeChat's Systems for the WMT22 Chat Translation Task

This paper introduces the joint submission of the Beijing Jiaotong Unive...
research
05/04/2022

P^3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning

Compared to other language tasks, applying pre-trained language models (...
research
04/28/2022

RoBLEURT Submission for the WMT2021 Metrics Task

In this paper, we present our submission to Shared Metrics Task: RoBLEUR...
research
10/12/2022

Improved Data Augmentation for Translation Suggestion

Translation suggestion (TS) models are used to automatically provide alt...
research
07/24/2019

Unbabel's Participation in the WMT19 Translation Quality Estimation Shared Task

We present the contribution of the Unbabel team to the WMT 2019 Shared T...
research
04/26/2023

Improving Conversational Passage Re-ranking with View Ensemble

This paper presents ConvRerank, a conversational passage re-ranker that ...

Please sign up or login with your details

Forgot password? Click here to reset