EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation

02/16/2022
by   Tao Ge, et al.
0

We propose EdgeFormer – a parameter-efficient Transformer of the encoder-decoder architecture for on-device seq2seq generation, which is customized under the strict computation and memory constraints. EdgeFormer proposes two novel principles for cost-effective parameterization and further enhance the model with efficient layer adaptation. We conduct extensive experiments on two practical on-device seq2seq tasks: Machine Translation and Grammatical Error Correction, and show that EdgeFormer can effectively outperform previous parameter-efficient Transformer baselines and achieve very competitive results with knowledge distillation under both the computation and memory constraints.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset