Re-Translation Strategies For Long Form, Simultaneous, Spoken Language Translation

12/06/2019
by   Naveen Arivazhagan, et al.
0

We investigate the problem of simultaneous machine translation of long-form speech content. We target a continuous speech-to-text scenario, generating translated captions for a live audio feed, such as a lecture or play-by-play commentary. As this scenario allows for revisions to our incremental translations, we adopt a re-translation approach to simultaneous translation, where the source is repeatedly translated from scratch as it grows. This approach naturally exhibits very low latency and high final quality, but at the cost of incremental instability as the output is continuously refined. We experiment with a pipeline of industry-grade speech recognition and translation tools, augmented with simple inference heuristics to improve stability. We use TED Talks as a source of multilingual test data, developing our techniques on English-to-German spoken language translation. Our minimalist approach to simultaneous translation allows us to easily scale our final evaluation to six more target languages, dramatically improving incremental stability for all of them.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/15/2021

Incremental Speech Synthesis For Speech-To-Speech Translation

In a speech-to-speech translation (S2ST) pipeline, the text-to-speech (T...
research
05/30/2020

Dynamic Masking for Improved Stability in Spoken Language Translation

For spoken language translation (SLT) in live scenarios such as conferen...
research
03/04/2022

Comprehension of Subtitles from Re-Translating Simultaneous Speech Translation

In simultaneous speech translation, one can vary the size of the output ...
research
05/26/2023

Robustness of Multi-Source MT to Transcription Errors

Automatic speech translation is sensitive to speech recognition errors, ...
research
10/20/2020

Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training

Simultaneous speech-to-speech translation is widely useful but extremely...
research
03/28/2022

Multilingual Simultaneous Speech Translation

Applications designed for simultaneous speech translation during events ...
research
06/24/2021

On the Influence of Machine Translation on Language Origin Obfuscation

In the last decade, machine translation has become a popular means to de...

Please sign up or login with your details

Forgot password? Click here to reset