Ensembling of Distilled Models from Multi-task Teachers for Constrained Resource Language Pairs

11/26/2021
by   Amr Hendy, et al.
0

This paper describes our submission to the constrained track of WMT21 shared news translation task. We focus on the three relatively low resource language pairs Bengali to and from Hindi, English to and from Hausa, and Xhosa to and from Zulu. To overcome the limitation of relatively low parallel data we train a multilingual model using a multitask objective employing both parallel and monolingual data. In addition, we augment the data using back translation. We also train a bilingual model incorporating back translation and knowledge distillation then combine the two models using sequence-to-sequence mapping. We see around 70 around 25 to and from Zulu compared to bilingual baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/20/2021

CUNI systems for WMT21: Multilingual Low-Resource Translation for Indo-European Languages Shared Task

This paper describes Charles University submission for Multilingual Low-...
research
11/18/2020

The Ubiqus English-Inuktitut System for WMT20

This paper describes Ubiqus' submission to the WMT20 English-Inuktitut s...
research
11/16/2020

Facebook AI's WMT20 News Translation Task Submission

This paper describes Facebook AI's submission to WMT20 shared news trans...
research
04/08/2020

Transfer learning and subword sampling for asymmetric-resource one-to-many neural translation

There are several approaches for improving neural machine translation fo...
research
03/02/2023

Letz Translate: Low-Resource Machine Translation for Luxembourgish

Natural language processing of Low-Resource Languages (LRL) is often cha...
research
08/27/2021

From Pivots to Graphs: Augmented CycleDensity as a Generalization to One Time InverseConsultation

This paper describes an approach used to generate new translations using...
research
10/19/2022

A baseline revisited: Pushing the limits of multi-segment models for context-aware translation

This paper addresses the task of contextual translation using multi-segm...

Please sign up or login with your details

Forgot password? Click here to reset