Neural Machine Translation with Imbalanced Classes

04/05/2020
by   Thamme Gowda, et al.
3

We cast neural machine translation (NMT) as a classification task in an autoregressive setting and analyze the limitations of both classification and autoregression components. Classifiers are known to perform better with balanced class distributions during training. Since the Zipfian nature of languages causes imbalanced classes, we explore the effect of class imbalance on NMT. We analyze the effect of vocabulary sizes on NMT performance and reveal an explanation for 'why' certain vocabulary sizes are better than others.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset