CUNI Systems for the Unsupervised News Translation Task in WMT 2019

07/29/2019
by   Ivana Kvapilíková, et al.
0

In this paper we describe the CUNI translation system used for the unsupervised news shared task of the ACL 2019 Fourth Conference on Machine Translation (WMT19). We follow the strategy of Artexte et al. (2018b), creating a seed phrase-based system where the phrase table is initialized from cross-lingual embedding mappings trained on monolingual data, followed by a neural machine translation system trained on synthetic parallel data. The synthetic corpus was produced from a monolingual corpus by a tuned PBMT model refined through iterative back-translation. We further focus on the handling of named entities, i.e. the part of vocabulary where the cross-lingual embedding mapping suffers most. Our system reaches a BLEU score of 15.3 on the German-Czech WMT19 shared task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/04/2018

Unsupervised Statistical Machine Translation

While modern machine translation has relied on large parallel corpora, a...
research
07/24/2019

Bilingual Lexicon Induction through Unsupervised Machine Translation

A recent research line has obtained strong results on bilingual lexicon ...
research
06/09/2016

Edinburgh Neural Machine Translation Systems for WMT 16

We participated in the WMT 2016 shared news translation task by building...
research
10/25/2020

The LMU Munich System for the WMT 2020 Unsupervised Machine Translation Shared Task

This paper describes the submission of LMU Munich to the WMT 2020 unsupe...
research
08/02/2023

Chat Translation Error Detection for Assisting Cross-lingual Communications

In this paper, we describe the development of a communication support sy...
research
06/05/2019

Deep learning based unsupervised concept unification in the embedding space

Humans are able to conceive physical reality by jointly learning differe...
research
07/29/2018

Fast derivation of neural network based document vectors with distance constraint and negative sampling

A universal cross-lingual representation of documents is very important ...

Please sign up or login with your details

Forgot password? Click here to reset