A New GAN-based End-to-End TTS Training Algorithm

04/09/2019
by   Haohan Guo, et al.
0

End-to-end, autoregressive model-based TTS has shown significant performance improvements over the conventional one. However, the autoregressive module training is affected by the exposure bias, or the mismatch between the different distributions of real and predicted data. While real data is available in training, but in testing, only predicted data is available to feed the autoregressive module. By introducing both real and generated data sequences in training, we can alleviate the effects of the exposure bias. We propose to use Generative Adversarial Network (GAN) along with the key idea of Professor Forcing in training. A discriminator in GAN is jointly trained to equalize the difference between real and predicted data. In AB subjective listening test, the results show that the new approach is preferred over the standard transfer learning with a CMOS improvement of 0.1. Sentence level intelligibility tests show significant improvement in a pathological test set. The GAN-trained new model is also more stable than the baseline to produce better alignments for the Tacotron output.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
09/16/2023

Music Generation based on Generative Adversarial Networks with Transformer

Autoregressive models based on Transformers have become the prevailing a...
research
11/28/2017

Restricting Greed in Training of Generative Adversarial Network

Generative adversarial network (GAN) has gotten wide re-search interest ...
research
02/07/2021

HGAN: Hybrid Generative Adversarial Network

In this paper, we present a simple approach to train Generative Adversar...
research
07/20/2020

Incorporating Reinforced Adversarial Learning in Autoregressive Image Generation

Autoregressive models recently achieved comparable results versus state-...
research
11/07/2019

Teacher-Student Training for Robust Tacotron-based TTS

While neural end-to-end text-to-speech (TTS) is superior to conventional...
research
05/30/2019

Adversarial Sub-sequence for Text Generation

Generative adversarial nets (GAN) has been successfully introduced for g...

Please sign up or login with your details

Forgot password? Click here to reset