Multilingual Speech-to-Speech Translation into Multiple Target Languages

07/17/2023
by   Hongyu Gong, et al.
0

Speech-to-speech translation (S2ST) enables spoken communication between people talking in different languages. Despite a few studies on multilingual S2ST, their focus is the multilinguality on the source side, i.e., the translation from multiple source languages to one target language. We present the first work on multilingual S2ST supporting multiple target languages. Leveraging recent advance in direct S2ST with speech-to-unit and vocoder, we equip these key components with multilingual capability. Speech-to-masked-unit (S2MU) is the multilingual extension of S2U, which applies masking to units which don't belong to the given target language to reduce the language interference. We also propose multilingual vocoder which is trained with language embedding and the auxiliary loss of language identification. On benchmark translation testsets, our proposed multilingual model shows superior performance than bilingual models in the translation from English into 16 target languages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2020

CoVoST 2 and Massively Multilingual Speech-to-Text Translation

Speech translation has recently become an increasingly popular topic of ...
research
09/28/2022

Multilingual Transitivity and Bidirectional Multilingual Agreement for Multilingual Document-level Machine Translation

Multilingual machine translation has been proven an effective strategy t...
research
06/29/2023

Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data

We propose a method for speech-to-speech emotionpreserving translation t...
research
01/29/2020

Improving Language Identification for Multilingual Speakers

Spoken language identification (LID) technologies have improved in recen...
research
05/31/2023

Multilingual Multi-Figurative Language Detection

Figures of speech help people express abstract concepts and evoke strong...
research
06/02/2023

Multilingual Conceptual Coverage in Text-to-Image Models

We propose "Conceptual Coverage Across Languages" (CoCo-CroLa), a techni...
research
05/08/2020

Synchronous Bidirectional Learning for Multilingual Lip Reading

Lip reading has received increasing attention in recent years. This pape...

Please sign up or login with your details

Forgot password? Click here to reset