Exploiting Linguistic Resources for Neural Machine Translation Using Multi-task Learning

08/03/2017
by   Jan Niehues, et al.
0

Linguistic resources such as part-of-speech (POS) tags have been extensively used in statistical machine translation (SMT) frameworks and have yielded better performances. However, usage of such linguistic annotations in neural machine translation (NMT) systems has been left under-explored. In this work, we show that multi-task learning is a successful and a easy approach to introduce an additional knowledge into an end-to-end neural attentional model. By jointly training several natural language processing (NLP) tasks in one system, we are able to leverage common information and improve the performance of the individual task. We analyze the impact of three design decisions in multi-task learning: the tasks used in training, the training schedule, and the degree of parameter sharing across the tasks, which is defined by the network architecture. The experiments are conducted for an German to English translation task. As additional linguistic resources, we exploit POS information and named-entities (NE). Experiments show that the translation quality can be improved by up to 1.5 BLEU points under the low-resource condition. The performance of the POS tagger is also improved using the multi-task learning scheme.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/11/2018

Neural Machine Translation for Bilingually Scarce Scenarios: A Deep Multi-task Learning Approach

Neural machine translation requires large amounts of parallel training t...
research
06/12/2018

Multi-Task Neural Models for Translating Between Styles Within and Across Languages

Generating natural language requires conveying content in an appropriate...
research
08/01/2018

Low-Latency Neural Speech Translation

Through the development of neural machine translation, the quality of ma...
research
10/16/2020

Training Flexible Depth Model by Multi-Task Learning for Neural Machine Translation

The standard neural machine translation model can only decode with the s...
research
10/02/2022

The boundaries of meaning: a case study in neural machine translation

The success of deep learning in natural language processing raises intri...
research
03/29/2022

Visualizing the Relationship Between Encoded Linguistic Information and Task Performance

Probing is popular to analyze whether linguistic information can be capt...
research
10/04/2018

AutoLoss: Learning Discrete Schedules for Alternate Optimization

Many machine learning problems involve iteratively and alternately optim...

Please sign up or login with your details

Forgot password? Click here to reset