Neural Machine Translation with Imbalanced Classes

04/05/2020
by   Thamme Gowda, et al.
3

We cast neural machine translation (NMT) as a classification task in an autoregressive setting and analyze the limitations of both classification and autoregression components. Classifiers are known to perform better with balanced class distributions during training. Since the Zipfian nature of languages causes imbalanced classes, we explore the effect of class imbalance on NMT. We analyze the effect of vocabulary sizes on NMT performance and reveal an explanation for 'why' certain vocabulary sizes are better than others.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/25/2017

Machine Translation at Booking.com: Journey and Lessons Learned

We describe our recently developed neural machine translation (NMT) syst...
research
06/29/2016

Compression of Neural Machine Translation Models via Pruning

Neural Machine Translation (NMT), like many other deep learning domains,...
research
06/27/2017

Memory-augmented Chinese-Uyghur Neural Machine Translation

Neural machine translation (NMT) has achieved notable performance recent...
research
09/18/2017

Toward a full-scale neural machine translation in production: the Booking.com use case

While some remarkable progress has been made in neural machine translati...
research
11/03/2018

Transfer Learning in Multilingual Neural Machine Translation with Dynamic Vocabulary

We propose a method to transfer knowledge across neural machine translat...
research
03/14/2021

Crowdsourced Phrase-Based Tokenization for Low-Resourced Neural Machine Translation: The Case of Fon Language

Building effective neural machine translation (NMT) models for very low-...
research
05/04/2020

Evaluating Explanation Methods for Neural Machine Translation

Recently many efforts have been devoted to interpreting the black-box NM...

Please sign up or login with your details

Forgot password? Click here to reset