Latent Part-of-Speech Sequences for Neural Machine Translation

08/30/2019
by   Xuewen Yang, et al.
0

Learning target side syntactic structure has been shown to improve Neural Machine Translation (NMT). However, incorporating syntax through latent variables introduces additional complexity in inference, as the models need to marginalize over the latent syntactic structures. To avoid this, models often resort to greedy search which only allows them to explore a limited portion of the latent space. In this work, we introduce a new latent variable model, LaSyn, that captures the co-dependence between syntax and semantics, while allowing for effective and efficient inference over the latent space. LaSyn decouples direct dependence between successive latent variables, which allows its decoder to exhaustively search through the latent syntactic choices, while keeping decoding speed proportional to the size of the latent variable vocabulary. We implement LaSyn by modifying a transformer-based NMT system and design a neural expectation maximization algorithm that we regularize with part-of-speech information as the latent sequences. Evaluations on four different MT tasks show that incorporating target side syntax with LaSyn improves both translation quality, and also provides an opportunity to improve diversity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2020

Variational Neural Machine Translation with Normalizing Flows

Variational Neural Machine Translation (VNMT) is an attractive framework...
research
12/11/2018

Conditional Variational Autoencoder for Neural Machine Translation

We explore the performance of latent variable models for conditional tex...
research
05/02/2023

Shared Latent Space by Both Languages in Non-Autoregressive Neural Machine Translation

Latent variable modeling in non-autoregressive neural machine translatio...
research
05/28/2018

Inducing Grammars with and for Neural Machine Translation

Machine translation systems require semantic knowledge and grammatical u...
research
01/16/2018

Variational Recurrent Neural Machine Translation

Partially inspired by successful applications of variational recurrent n...
research
08/28/2018

A Tree-based Decoder for Neural Machine Translation

Recent advances in Neural Machine Translation (NMT) show that adding syn...
research
05/01/2018

Multi-representation Ensembles and Delayed SGD Updates Improve Syntax-based NMT

We explore strategies for incorporating target syntax into Neural Machin...

Please sign up or login with your details

Forgot password? Click here to reset