DeepAI AI Chat
Log In Sign Up

Cognate-aware morphological segmentation for multilingual neural translation

by   Stig-Arne Grönroos, et al.

This article describes the Aalto University entry to the WMT18 News Translation Shared Task. We participate in the multilingual subtrack with a system trained under the constrained condition to translate from English to both Finnish and Estonian. The system is based on the Transformer model. We focus on improving the consistency of morphological segmentation for words that are similar orthographically, semantically, and distributionally; such words include etymological cognates, loan words, and proper names. For this, we introduce Cognate Morfessor, a multilingual variant of the Morfessor method. We show that our approach improves the translation quality particularly for Estonian, which has less resources for training the translation model.


page 1

page 2

page 3

page 4


The Ubiqus English-Inuktitut System for WMT20

This paper describes Ubiqus' submission to the WMT20 English-Inuktitut s...

UAlberta at SemEval 2022 Task 2: Leveraging Glosses and Translations for Multilingual Idiomaticity Detection

We describe the University of Alberta systems for the SemEval-2022 Task ...

The University of Helsinki submissions to the WMT19 news translation task

In this paper, we present the University of Helsinki submissions to the ...

Facebook AI WMT21 News Translation Task Submission

We describe Facebook's multilingual model submission to the WMT2021 shar...

CUNI System for the WMT19 Robustness Task

We present our submission to the WMT19 Robustness Task. Our baseline sys...

Morphological Segmentation Inside-Out

Morphological segmentation has traditionally been modeled with non-hiera...