Simple and Effective Noisy Channel Modeling for Neural Machine Translation

08/15/2019
by   Kyra Yee, et al.
0

Previous work on neural noisy channel modeling relied on latent variable models that incrementally process the source and target sentence. This makes decoding decisions based on partial source prefixes even though the full source is available. We pursue an alternative approach based on standard sequence to sequence models which utilize the entire source. These models perform remarkably well as channel models, even though they have neither been trained on, nor designed to factor over incomplete target sentences. Experiments with neural language models trained on billions of words show that noisy channel models can outperform a direct model by up to 3.2 BLEU on WMT'17 German-English translation. We evaluate on four language-pairs and our channel models consistently outperform strong alternatives such right-to-left reranking models and ensembles of direct models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2016

The Neural Noisy Channel

We formulate sequence to sequence transduction as a noisy channel decodi...
research
11/13/2020

Language Models not just for Pre-training: Fast Online Neural Noisy Channel Modeling

Pre-training models on vast quantities of unlabeled data has emerged as ...
research
06/13/2018

Generative Neural Machine Translation

We introduce Generative Neural Machine Translation (GNMT), a latent vari...
research
11/24/2020

Two-Way Neural Machine Translation: A Proof of Concept for Bidirectional Translation Modeling using a Two-Dimensional Grid

Neural translation models have proven to be effective in capturing suffi...
research
11/06/2022

Noisy Channel for Automatic Text Simplification

In this paper we present a simple re-ranking method for Automatic Senten...
research
01/04/2016

Mutual Information and Diverse Decoding Improve Neural Machine Translation

Sequence-to-sequence neural translation models learn semantic and syntac...
research
10/15/2015

Multilingual Image Description with Neural Sequence Models

In this paper we present an approach to multi-language image description...

Please sign up or login with your details

Forgot password? Click here to reset