Memory-Augmented Neural Networks for Machine Translation

09/18/2019
by   Mark Collier, et al.
18

Memory-augmented neural networks (MANNs) have been shown to outperform other recurrent neural network architectures on a series of artificial sequence learning tasks, yet they have had limited application to real-world tasks. We evaluate direct application of Neural Turing Machines (NTM) and Differentiable Neural Computers (DNC) to machine translation. We further propose and evaluate two models which extend the attentional encoder-decoder with capabilities inspired by memory augmented neural networks. We evaluate our proposed models on IWSLT Vietnamese to English and ACL Romanian to English datasets. Our proposed models and the memory augmented neural networks perform similarly to the attentional encoder-decoder on the Vietnamese to English translation task while have a 0.3-1.9 lower BLEU score for the Romanian to English task. Interestingly, our analysis shows that despite being equipped with additional flexibility and being randomly initialized memory augmented neural networks learn an algorithm for machine translation almost identical to the attentional encoder-decoder.

READ FULL TEXT

page 6

page 7

page 8

research
06/12/2017

Attention Is All You Need

The dominant sequence transduction models are based on complex recurrent...
research
10/27/2016

Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes

Neural networks augmented with external memory have the ability to learn...
research
12/04/2017

An Encoder-Decoder Model for ICD-10 Coding of Death Certificates

Information extraction from textual documents such as hospital records a...
research
06/14/2021

English to Bangla Machine Translation Using Recurrent Neural Network

The applications of recurrent neural networks in machine translation are...
research
07/01/2019

Understanding Memory Modules on Learning Simple Algorithms

Recent work has shown that memory modules are crucial for the generaliza...
research
09/14/2020

Reservoir Memory Machines as Neural Computers

Differentiable neural computers extend artificial neural networks with a...
research
12/27/2017

CNN Is All You Need

The Convolution Neural Network (CNN) has demonstrated the unique advanta...

Please sign up or login with your details

Forgot password? Click here to reset