Towards Bidirectional Hierarchical Representations for Attention-Based Neural Machine Translation

07/17/2017
by   Baosong Yang, et al.
0

This paper proposes a hierarchical attentional neural translation model which focuses on enhancing source-side hierarchical representations by covering both local and global semantic information using a bidirectional tree-based encoder. To maximize the predictive likelihood of target words, a weighted variant of an attention mechanism is used to balance the attentive information between lexical and phrase vectors. Using a tree-based rare word encoding, the proposed model is extended to sub-word level to alleviate the out-of-vocabulary (OOV) problem. Empirical results reveal that the proposed model significantly outperforms sequence-to-sequence attention-based and tree-based neural translation models in English-Chinese translation tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/15/2015

Agreement-based Joint Training for Bidirectional Attention-based Neural Machine Translation

The attentional mechanism has proven to be effective in improving end-to...
research
03/19/2016

Tree-to-Sequence Attentional Neural Machine Translation

Most of the existing Neural Machine Translation (NMT) models focus on th...
research
06/29/2018

Neural Machine Translation with Key-Value Memory-Augmented Attention

Although attention-based Neural Machine Translation (NMT) has achieved r...
research
11/25/2016

Neural Machine Translation with Latent Semantic of Image and Text

Although attention-based Neural Machine Translation have achieved great ...
research
08/22/2018

Learning When to Concentrate or Divert Attention: Self-Adaptive Attention Temperature for Neural Machine Translation

Most of the Neural Machine Translation (NMT) models are based on the seq...
research
11/29/2019

Neural Chinese Word Segmentation as Sequence to Sequence Translation

Recently, Chinese word segmentation (CWS) methods using neural networks ...
research
03/13/2020

Sentence Level Human Translation Quality Estimation with Attention-based Neural Networks

This paper explores the use of Deep Learning methods for automatic estim...

Please sign up or login with your details

Forgot password? Click here to reset