Machine Translation in Pronunciation Space

11/03/2019
by   Hairong Liu, et al.
0

The research in machine translation community focus on translation in text space. However, humans are in fact also good at direct translation in pronunciation space. Some existing translation systems, such as simultaneous machine translation, are inherently more natural and thus potentially more robust by directly translating in pronunciation space. In this paper, we conduct large scale experiments on a self-built dataset with about 20M En-Zh pairs of text sentences and corresponding pronunciation sentences. We proposed three new categories of translations: 1) translating a pronunciation sentence in source language into a pronunciation sentence in target language (P2P-Tran), 2) translating a text sentence in source language into a pronunciation sentence in target language (T2P-Tran), and 3) translating a pronunciation sentence in source language into a text sentence in target language (P2T-Tran), and compare them with traditional text translation (T2T-Tran). Our experiments clearly show that all 4 categories of translations have comparable performances, with small and sometimes ignorable differences.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2021

Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement

Recent work in simultaneous machine translation is often trained with co...
research
08/05/2016

Winograd Schemas and Machine Translation

A Winograd schema is a pair of sentences that differ in a single word an...
research
09/08/2021

Mixup Decoding for Diverse Machine Translation

Diverse machine translation aims at generating various target language t...
research
05/11/2023

Subword Segmental Machine Translation: Unifying Segmentation and Target Sentence Generation

Subword segmenters like BPE operate as a preprocessing step in neural ma...
research
02/27/2015

Local Translation Prediction with Global Sentence Representation

Statistical machine translation models have made great progress in impro...
research
05/31/2022

VALHALLA: Visual Hallucination for Machine Translation

Designing better machine translation systems by considering auxiliary in...
research
06/24/2021

On the Influence of Machine Translation on Language Origin Obfuscation

In the last decade, machine translation has become a popular means to de...

Please sign up or login with your details

Forgot password? Click here to reset