Compression of Neural Machine Translation Models via Pruning

06/29/2016
by   Abigail See, et al.
0

Neural Machine Translation (NMT), like many other deep learning domains, typically suffers from over-parameterization, resulting in large storage sizes. This paper examines three simple magnitude-based pruning schemes to compress NMT models, namely class-blind, class-uniform, and class-distribution, which differ in terms of how pruning thresholds are computed for the different classes of weights in the NMT architecture. We demonstrate the efficacy of weight pruning as a compression technique for a state-of-the-art NMT system. We show that an NMT model with over 200 million parameters can be pruned by 40 with very little performance loss as measured on the WMT'14 English-German translation task. This sheds light on the distribution of redundancy in the NMT architecture. Our main result is that with retraining, we can recover and even surpass the original performance with an 80

READ FULL TEXT
research
05/08/2020

Neural Machine Translation for South Africa's Official Languages

Recent advances in neural machine translation (NMT) have led to state-of...
research
04/05/2020

Neural Machine Translation with Imbalanced Classes

We cast neural machine translation (NMT) as a classification task in an ...
research
05/24/2019

An Analysis of Source-Side Grammatical Errors in NMT

The quality of Neural Machine Translation (NMT) has been shown to signif...
research
10/18/2019

A language processing algorithm for predicting tactical solutions to an operational planning problem under uncertainty

This paper is devoted to the prediction of solutions to a stochastic dis...
research
06/15/2016

The Edit Distance Transducer in Action: The University of Cambridge English-German System at WMT16

This paper presents the University of Cambridge submission to WMT16. Mot...
research
11/26/2017

Learning to Remember Translation History with a Continuous Cache

Existing neural machine translation (NMT) models generally translate sen...
research
09/13/2019

Neural Machine Translation with 4-Bit Precision and Beyond

Neural Machine Translation (NMT) is resource intensive. We design a quan...

Please sign up or login with your details

Forgot password? Click here to reset