Lingua Custodia's participation at the WMT 2021 Machine Translation using Terminologies shared task

11/03/2021
by   Melissa Ailem, et al.
0

This paper describes Lingua Custodia's submission to the WMT21 shared task on machine translation using terminologies. We consider three directions, namely English to French, Russian, and Chinese. We rely on a Transformer-based architecture as a building block, and we explore a method which introduces two main changes to the standard procedure to handle terminologies. The first one consists in augmenting the training data in such a way as to encourage the model to learn a copy behavior when it encounters terminology constraint terms. The second change is constraint token masking, whose purpose is to ease copy behavior learning and to improve model generalization. Empirical results show that our method satisfies most terminology constraints while maintaining high translation quality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2021

Encouraging Neural Machine Translation to Satisfy Terminology Constraints

We present a new approach to encourage neural machine translation to sat...
research
09/22/2021

The NiuTrans Machine Translation Systems for WMT21

This paper describes NiuTrans neural machine translation systems of the ...
research
07/27/2017

A Shared Task on Bandit Learning for Machine Translation

We introduce and describe the results of a novel shared task on bandit l...
research
11/30/2022

Findings of the WMT 2022 Shared Task on Translation Suggestion

We report the result of the first edition of the WMT shared task on Tran...
research
10/28/2020

The Volctrans Machine Translation System for WMT20

This paper describes our VolcTrans system on WMT20 shared news translati...
research
09/20/2021

CUNI systems for WMT21: Terminology translation Shared Task

This paper describes Charles University submission for Terminology trans...
research
05/30/2018

Marian: Cost-effective High-Quality Neural Machine Translation in C++

This paper describes the submissions of the "Marian" team to the WNMT 20...

Please sign up or login with your details

Forgot password? Click here to reset