Dense Information Flow for Neural Machine Translation

06/03/2018
by   Yanyao Shen, et al.
0

Recently, neural machine translation has achieved remarkable progress by introducing well-designed deep neural networks into its encoder-decoder framework. From the optimization perspective, residual connections are adopted to improve learning performance for both encoder and decoder in most of these deep architectures, and advanced attention connections are applied as well. Inspired by the success of the DenseNet model in computer vision problems, in this paper, we propose a densely connected NMT architecture (DenseNMT) that is able to train more efficiently for NMT. The proposed DenseNMT not only allows dense connection in creating new features for both encoder and decoder, but also uses the dense attention structure to improve attention quality. Our experiments on multiple datasets show that DenseNMT structure is more competitive and efficient.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/13/2016

Implicit Distortion and Fertility Models for Attention-based Encoder-Decoder NMT Model

Neural machine translation has shown very promising results lately. Most...
research
04/28/2019

Neural Machine Translation with Recurrent Highway Networks

Recurrent Neural Networks have lately gained a lot of popularity in lang...
research
04/16/2019

SparseMask: Differentiable Connectivity Learning for Dense Image Prediction

In this paper, we aim at automatically searching an efficient network ar...
research
09/12/2017

Refining Source Representations with Relation Networks for Neural Machine Translation

Although neural machine translation (NMT) with the encoder-decoder frame...
research
01/22/2019

Understanding Geometry of Encoder-Decoder CNNs

Encoder-decoder networks using convolutional neural network (CNN) archit...
research
05/10/2022

Multifidelity data fusion in convolutional encoder/decoder networks

We analyze the regression accuracy of convolutional neural networks asse...
research
06/25/2022

Probing Causes of Hallucinations in Neural Machine Translations

Hallucination, one kind of pathological translations that bothers Neural...

Please sign up or login with your details

Forgot password? Click here to reset