The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation

04/26/2018
by   Mia Xu Chen, et al.
0

The past year has witnessed rapid advances in sequence-to-sequence (seq2seq) modeling for Machine Translation (MT). The classic RNN-based approaches to MT were first out-performed by the convolutional seq2seq model, which was then out-performed by the more recent Transformer model. Each of these new approaches consists of a fundamental architecture accompanied by a set of modeling and training techniques that are in principle applicable to other seq2seq architectures. In this paper, we tease apart the new architectures and their accompanying techniques in two ways. First, we identify several key modeling and training techniques, and apply them to the RNN architecture, yielding a new RNMT+ model that outperforms all of the three fundamental architectures on the benchmark WMT'14 English to French and English to German tasks. Second, we analyze the properties of each fundamental seq2seq architecture and devise new hybrid architectures intended to combine their strengths. Our hybrid models obtain further improvements, outperforming the RNMT+ model on both benchmark datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/19/2019

CVIT-MT Systems for WAT-2018

This document describes the machine translation system used in the submi...
research
05/04/2018

Upping the Ante: Towards a Better Benchmark for Chinese-to-English Machine Translation

There are many machine translation (MT) papers that propose novel approa...
research
12/19/2018

DTMT: A Novel Deep Transition Architecture for Neural Machine Translation

Past years have witnessed rapid developments in Neural Machine Translati...
research
06/01/2020

Online Versus Offline NMT Quality: An In-depth Analysis on English-German and German-English

We conduct in this work an evaluation study comparing offline and online...
research
09/02/2018

Towards Automated Customer Support

Recent years have seen growing interest in conversational agents, such a...
research
08/02/2017

The University of Edinburgh's Neural MT Systems for WMT17

This paper describes the University of Edinburgh's submissions to the WM...
research
08/02/2023

Empirical Translation Process Research: Past and Possible Future Perspectives

Over the past four decades, efforts have been made to develop and evalua...

Please sign up or login with your details

Forgot password? Click here to reset