Domain Adaptation in Neural Machine Translation using a Qualia-Enriched FrameNet

02/21/2022
by   Alexandre Diniz Costa, et al.
0

In this paper we present Scylla, a methodology for domain adaptation of Neural Machine Translation (NMT) systems that make use of a multilingual FrameNet enriched with qualia relations as an external knowledge base. Domain adaptation techniques used in NMT usually require fine-tuning and in-domain training data, which may pose difficulties for those working with lesser-resourced languages and may also lead to performance decay of the NMT system for out-of-domain sentences. Scylla does not require fine-tuning of the NMT model, avoiding the risk of model over-fitting and consequent decrease in performance for out-of-domain translations. Two versions of Scylla are presented: one using the source sentence as input, and another one using the target sentence. We evaluate Scylla in comparison to a state-of-the-art commercial NMT system in an experiment in which 50 sentences from the Sports domain are translated from Brazilian Portuguese to English. The two versions of Scylla significantly outperform the baseline commercial system in HTER.

READ FULL TEXT

page 11

page 12

research
01/12/2017

An Empirical Comparison of Simple Domain Adaptation Methods for Neural Machine Translation

In this paper, we propose a novel domain adaptation method named "mixed ...
research
06/19/2019

Multilingual Multi-Domain Adaptation Approaches for Neural Machine Translation

In this paper, we propose two novel methods for domain adaptation for th...
research
10/30/2019

Ordering Matters: Word Ordering Aware Unsupervised NMT

Denoising-based Unsupervised Neural Machine Translation (U-NMT) models t...
research
09/16/2021

Translation Transformers Rediscover Inherent Data Domains

Many works proposed methods to improve the performance of Neural Machine...
research
09/07/2017

Translating Domain-Specific Expressions in Knowledge Bases with Neural Machine Translation

Our work presented in this paper focuses on the translation of domain-sp...
research
03/23/2021

Repairing Pronouns in Translation with BERT-Based Post-Editing

Pronouns are important determinants of a text's meaning but difficult to...
research
12/31/2020

FDMT: A Benchmark Dataset for Fine-grained Domain Adaptation in Machine Translation

Previous domain adaptation research usually neglect the diversity in tra...

Please sign up or login with your details

Forgot password? Click here to reset