C-NMT: A Collaborative Inference Framework for Neural Machine Translation

04/08/2022
by   Yukai Chen, et al.
0

Collaborative Inference (CI) optimizes the latency and energy consumption of deep learning inference through the inter-operation of edge and cloud devices. Albeit beneficial for other tasks, CI has never been applied to the sequence- to-sequence mapping problem at the heart of Neural Machine Translation (NMT). In this work, we address the specific issues of collaborative NMT, such as estimating the latency required to generate the (unknown) output sequence, and show how existing CI methods can be adapted to these applications. Our experiments show that CI can reduce the latency of NMT by up to 44 a non-collaborative approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/19/2019

Semantic Neural Machine Translation using AMR

It is intuitive that semantic representations can be useful for machine ...
research
03/19/2016

Tree-to-Sequence Attentional Neural Machine Translation

Most of the existing Neural Machine Translation (NMT) models focus on th...
research
10/07/2022

NMTSloth: Understanding and Testing Efficiency Degradation of Neural Machine Translation Systems

Neural Machine Translation (NMT) systems have received much recent atten...
research
02/27/2020

Echo State Neural Machine Translation

We present neural machine translation (NMT) models inspired by echo stat...
research
04/02/2021

Attention Forcing for Machine Translation

Auto-regressive sequence-to-sequence models with attention mechanisms ha...
research
05/10/2018

First Experiments with Neural Translation of Informal to Formal Mathematics

We report on our first experiments to train deep neural networks that au...
research
10/09/2020

Uncertainty-Aware Semantic Augmentation for Neural Machine Translation

As a sequence-to-sequence generation task, neural machine translation (N...

Please sign up or login with your details

Forgot password? Click here to reset