Rule-based Morphological Inflection Improves Neural Terminology Translation

09/10/2021
by   Weijia Xu, et al.
0

Current approaches to incorporating terminology constraints in machine translation (MT) typically assume that the constraint terms are provided in their correct morphological forms. This limits their application to real-world scenarios where constraint terms are provided as lemmas. In this paper, we introduce a modular framework for incorporating lemma constraints in neural MT (NMT) in which linguistic knowledge and diverse types of NMT models can be flexibly applied. It is based on a novel cross-lingual inflection module that inflects the target lemma constraints based on the source context. We explore linguistically motivated rule-based and data-driven neural-based inflection modules and design English-German health and English-Lithuanian news test suites to evaluate them in domain adaptation and low-resource MT settings. Results show that our rule-based inflection module helps NMT models incorporate lemma constraints more accurately than a neural module and outperforms the existing end-to-end approach with lower training costs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/30/2018

Pronoun Translation in English-French Machine Translation: An Analysis of Error Types

Pronouns are a long-standing challenge in machine translation. We presen...
research
08/12/2017

Statistical Vs Rule Based Machine Translation; A Case Study on Indian Language Perspective

In this paper we present our work on a case study between Statistical Ma...
research
09/13/2022

Data-adaptive Transfer Learning for Translation: A Case Study in Haitian and Jamaican

Multilingual transfer techniques often improve low-resource machine tran...
research
06/22/2021

On the Evaluation of Machine Translation for Terminology Consistency

As neural machine translation (NMT) systems become an important part of ...
research
05/01/2020

Selecting Backtranslated Data from Multiple Sources for Improved Neural Machine Translation

Machine translation (MT) has benefited from using synthetic training dat...
research
10/19/2020

Incorporating Terminology Constraints in Automatic Post-Editing

Users of machine translation (MT) may want to ensure the use of specific...
research
05/08/2021

Falling Through the Gaps: Neural Architectures as Models of Morphological Rule Learning

Recent advances in neural architectures have revived the problem of morp...

Please sign up or login with your details

Forgot password? Click here to reset