Neural Machine Translation with Recurrent Highway Networks

04/28/2019
by   Maulik Parmar, et al.
0

Recurrent Neural Networks have lately gained a lot of popularity in language modelling tasks, especially in neural machine translation(NMT). Very recent NMT models are based on Encoder-Decoder, where a deep LSTM based encoder is used to project the source sentence to a fixed dimensional vector and then another deep LSTM decodes the target sentence from the vector. However there has been very little work on exploring architectures that have more than one layer in space(i.e. in each time step). This paper examines the effectiveness of the simple Recurrent Highway Networks(RHN) in NMT tasks. The model uses Recurrent Highway Neural Network in encoder and decoder, with attention .We also explore the reconstructor model to improve adequacy. We demonstrate the effectiveness of all three approaches on the IWSLT English-Vietnamese dataset. We see that RHN performs on par with LSTM based models and even better in some cases.We see that deep RHN models are easy to train compared to deep LSTM based models because of highway connections. The paper also investigates the effects of increasing recurrent depth in each time step.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2018

Dense Information Flow for Neural Machine Translation

Recently, neural machine translation has achieved remarkable progress by...
research
06/14/2021

English to Bangla Machine Translation Using Recurrent Neural Network

The applications of recurrent neural networks in machine translation are...
research
07/18/2019

Understanding Neural Machine Translation by Simplification: The Case of Encoder-free Models

In this paper, we try to understand neural machine translation (NMT) via...
research
05/02/2017

Deep Neural Machine Translation with Linear Associative Unit

Deep Neural Networks (DNNs) have provably enhanced the state-of-the-art ...
research
10/31/2018

You May Not Need Attention

In NMT, how far can we get without attention and without separate encodi...
research
05/21/2017

Spelling Correction as a Foreign Language

In this paper, we reformulated the spell correction problem as a machine...
research
09/06/2017

Information-Propogation-Enhanced Neural Machine Translation by Relation Model

Even though sequence-to-sequence neural machine translation (NMT) model ...

Please sign up or login with your details

Forgot password? Click here to reset