Towards cross-language prosody transfer for dialog

07/09/2023
by   Jonathan E. Avila, et al.
0

Speech-to-speech translation systems today do not adequately support use for dialog purposes. In particular, nuances of speaker intent and stance can be lost due to improper prosody transfer. We present an exploration of what needs to be done to overcome this. First, we developed a data collection protocol in which bilingual speakers re-enact utterances from an earlier conversation in their other language, and used this to collect an English-Spanish corpus, so far comprising 1871 matched utterance pairs. Second, we developed a simple prosodic dissimilarity metric based on Euclidean distance over a broad set of prosodic features. We then used these to investigate cross-language prosodic differences, measure the likely utility of three simple baseline models, and identify phenomena which will require more powerful modeling. Our findings should inform future research on cross-language prosody and the design of speech-to-speech translation systems capable of effective prosody transfer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/18/2022

Dialogs Re-enacted Across Languages

To support machine learning of cross-language prosodic mappings and othe...
research
12/06/2016

Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text Translation

This paper proposes a first attempt to build an end-to-end speech-to-tex...
research
05/11/2018

Bootstrapping Multilingual Intent Models via Machine Translation for Dialog Automation

With the resurgence of chat-based dialog systems in consumer and enterpr...
research
08/10/2017

Towards Neural Speaker Modeling in Multi-Party Conversation: The Task, Dataset, and Models

Neural network-based dialog systems are attracting increasing attention ...
research
11/11/2022

Speech-to-Speech Translation For A Real-world Unwritten Language

We study speech-to-speech translation (S2ST) that translates speech from...
research
06/12/2021

Don't Rule Out Monolingual Speakers: A Method For Crowdsourcing Machine Translation Data

High-performing machine translation (MT) systems can help overcome langu...
research
05/31/2023

SLABERT Talk Pretty One Day: Modeling Second Language Acquisition with BERT

Second language acquisition (SLA) research has extensively studied cross...

Please sign up or login with your details

Forgot password? Click here to reset