Text Style Transfer Back-Translation

06/02/2023
by   Daimeng Wei, et al.
0

Back Translation (BT) is widely used in the field of machine translation, as it has been proved effective for enhancing translation quality. However, BT mainly improves the translation of inputs that share a similar style (to be more specific, translation-like inputs), since the source side of BT data is machine-translated. For natural inputs, BT brings only slight improvements and sometimes even adverse effects. To address this issue, we propose Text Style Transfer Back Translation (TST BT), which uses a style transfer model to modify the source side of BT data. By making the style of source-side text more natural, we aim to improve the translation of natural inputs. Our experiments on various language pairs, including both high-resource and low-resource ones, demonstrate that TST BT significantly improves translation performance against popular BT benchmarks. In addition, TST BT is proved to be effective in domain adaptation so this strategy can be regarded as a general data augmentation method. Our training code and text style transfer model are open-sourced.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2020

Semi-Supervised Low-Resource Style Transfer of Indonesian Informal to Formal Language with Iterative Forward-Translation

In its daily use, the Indonesian language is riddled with informality, t...
research
05/09/2022

So Different Yet So Alike! Constrained Unsupervised Text Style Transfer

Automatic transfer of text between domains has become popular in recent ...
research
08/23/2018

Style Transfer as Unsupervised Machine Translation

Language style transferring rephrases text with specific stylistic attri...
research
11/13/2017

Zero-Shot Style Transfer in Text Using Recurrent Neural Networks

Zero-shot translation is the task of translating between a language pair...
research
04/10/2023

ITportrait: Image-Text Coupled 3D Portrait Domain Adaptation

Domain adaptation of 3D portraits has gained more and more attention. Ho...
research
11/17/2022

Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation

Conversion of Chinese Grapheme-to-Phoneme (G2P) plays an important role ...
research
09/16/2023

Enhancing Visual Perception in Novel Environments via Incremental Data Augmentation Based on Style Transfer

The deployment of autonomous agents in real-world scenarios is challenge...

Please sign up or login with your details

Forgot password? Click here to reset