Sequence to Sequence Mixture Model for Diverse Machine Translation

10/17/2018
by   Xuanli He, et al.
0

Sequence to sequence (SEQ2SEQ) models often lack diversity in their generated translations. This can be attributed to the limitation of SEQ2SEQ models in capturing lexical and syntactic variations in a parallel corpus resulting from different styles, genres, topics, or ambiguity of the translation process. In this paper, we develop a novel sequence to sequence mixture (S2SMIX) model that improves both translation diversity and quality by adopting a committee of specialized translation models rather than a single translation model. Each mixture component selects its own training dataset via optimization of the marginal loglikelihood, which leads to a soft clustering of the parallel corpus. Experiments on four language pairs demonstrate the superiority of our mixture model compared to a SEQ2SEQ baseline with standard or diversity-boosted beam search. Our mixture model uses negligible additional parameters and incurs no extra computation cost during decoding.

READ FULL TEXT
research
09/08/2021

Mixup Decoding for Diverse Machine Translation

Diverse machine translation aims at generating various target language t...
research
02/20/2019

Mixture Models for Diverse Machine Translation: Tricks of the Trade

Mixture models trained via EM are among the simplest, most widely used a...
research
11/19/2017

Incorporating Syntactic Uncertainty in Neural Machine Translation with Forest-to-Sequence Model

Incorporating syntactic information in Neural Machine Translation models...
research
02/14/2022

Sequence-to-Sequence Resources for Catalan

In this work, we introduce sequence-to-sequence language resources for C...
research
12/11/2019

Unsupervised Neural Dialect Translation with Commonality and Diversity Modeling

As a special machine translation task, dialect translation has two main ...
research
02/28/2018

Analyzing Uncertainty in Neural Machine Translation

Machine translation is a popular test bed for research in neural sequenc...
research
06/07/2021

Lexicon Learning for Few-Shot Neural Sequence Modeling

Sequence-to-sequence transduction is the core problem in language proces...

Please sign up or login with your details

Forgot password? Click here to reset