Hidden Markov Transformer for Simultaneous Machine Translation

03/01/2023
by   Shaolei Zhang, et al.
0

Simultaneous machine translation (SiMT) outputs the target sequence while receiving the source sequence, and hence learning when to start translating each target token is the core challenge for SiMT task. However, it is non-trivial to learn the optimal moment among many possible moments of starting translating, as the moments of starting translating always hide inside the model and can only be supervised with the observed target sequence. In this paper, we propose a Hidden Markov Transformer (HMT), which treats the moments of starting translating as hidden events and the target sequence as the corresponding observed events, thereby organizing them as a hidden Markov model. HMT explicitly models multiple moments of starting translating as the candidate hidden events, and then selects one to generate the target token. During training, by maximizing the marginal likelihood of the target sequence over multiple moments of starting translating, HMT learns to start translating at the moments that target tokens can be generated more accurately. Experiments on multiple SiMT benchmarks show that HMT outperforms strong baselines and achieves state-of-the-art performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2020

Efficient Wait-k Models for Simultaneous Machine Translation

Simultaneous machine translation consists in starting output generation ...
research
10/22/2022

Information-Transport-based Policy for Simultaneous Translation

Simultaneous translation (ST) outputs translation while receiving the so...
research
10/20/2022

Wait-info Policy: Balancing Source and Target at Information Level for Simultaneous Machine Translation

Simultaneous machine translation (SiMT) outputs the translation while re...
research
09/12/2023

Glancing Future for Simultaneous Machine Translation

Simultaneous machine translation (SiMT) outputs translation while readin...
research
11/27/2019

Simultaneous Neural Machine Translation using Connectionist Temporal Classification

Simultaneous machine translation is a variant of machine translation tha...
research
01/25/2006

Fast Lexically Constrained Viterbi Algorithm (FLCVA): Simultaneous Optimization of Speed and Memory

Lexical constraints on the input of speech and on-line handwriting syste...
research
11/24/2012

Shadows and headless shadows: a worlds-based, autobiographical approach to reasoning

Many cognitive systems deploy multiple, closed, individually consistent ...

Please sign up or login with your details

Forgot password? Click here to reset