Dynamic Terminology Integration for COVID-19 and other Emerging Domains

09/10/2021
by   Toms Bergmanis, et al.
0

The majority of language domains require prudent use of terminology to ensure clarity and adequacy of information conveyed. While the correct use of terminology for some languages and domains can be achieved by adapting general-purpose MT systems on large volumes of in-domain parallel data, such quantities of domain-specific data are seldom available for less-resourced languages and niche domains. Furthermore, as exemplified by COVID-19 recently, no domain-specific parallel data is readily available for emerging domains. However, the gravity of this recent calamity created a high demand for reliable translation of critical information regarding pandemic and infection prevention. This work is part of WMT2021 Shared Task: Machine Translation using Terminologies, where we describe Tilde MT systems that are capable of dynamic terminology integration at the time of translation. Our systems achieve up to 94 without having access to any form of in-domain information during system training. We conclude our work with a broader discussion considering the Shared Task itself and terminology translation in MT.

READ FULL TEXT
research
03/31/2021

Domain-specific MT for Low-resource Languages: The case of Bambara-French

Translating to and from low-resource languages is a challenge for machin...
research
07/03/2020

TICO-19: the Translation Initiative for Covid-19

The COVID-19 pandemic is the worst pandemic to strike the world in over ...
research
02/20/2021

Machine Translation Customization via Automatic Training Data Selection from the Web

Machine translation (MT) systems, especially when designed for an indust...
research
05/01/2020

Facilitating Access to Multilingual COVID-19 Information via Neural Machine Translation

Every day, more people are becoming infected and dying from exposure to ...
research
03/24/2023

Towards Making the Most of ChatGPT for Machine Translation

ChatGPT shows remarkable capabilities for machine translation (MT). Seve...
research
11/14/2022

Findings of the Covid-19 MLIA Machine Translation Task

This work presents the results of the machine translation (MT) task from...
research
05/17/2023

A Survey on Zero Pronoun Translation

Zero pronouns (ZPs) are frequently omitted in pro-drop languages (e.g. C...

Please sign up or login with your details

Forgot password? Click here to reset