Expediting TTS Synthesis with Adversarial Vocoding

04/16/2019
by   Paarth Neekhara, et al.
0

Recent approaches in text-to-speech (TTS) synthesis employ neural network strategies to vocode perceptually-informed spectrogram representations directly into listenable waveforms. Such vocoding procedures create a computational bottleneck in modern TTS pipelines. We propose an alternative approach which utilizes generative adversarial networks (GANs) to learn mappings from perceptually-informed spectrograms to simple magnitude spectrograms which can be heuristically vocoded. Through a user study, we show that our approach significantly outperforms naïve vocoding strategies while being hundreds of times faster than neural network vocoders used in state-of-the-art TTS systems. We also show that our method can be used to achieve state-of-the-art results in unsupervised synthesis of individual words of speech.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2018

Reducing over-smoothness in speech synthesis using Generative Adversarial Networks

Speech synthesis is widely used in many practical applications. In recen...
research
10/06/2021

GANtron: Emotional Speech Synthesis with Generative Adversarial Networks

Speech synthesis is used in a wide variety of industries. Nonetheless, i...
research
03/12/2021

Signal Representations for Synthesizing Audio Textures with Generative Adversarial Networks

Generative Adversarial Networks (GANs) currently achieve the state-of-th...
research
03/14/2019

Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis

Recent studies have shown that text-to-speech synthesis quality can be i...
research
04/17/2021

Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement

The intelligibility of speech severely degrades in the presence of envir...
research
01/23/2023

StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

Text-to-image synthesis has recently seen significant progress thanks to...
research
07/20/2023

Efficient Beam Tree Recursion

Beam Tree Recursive Neural Network (BT-RvNN) was recently proposed as a ...

Please sign up or login with your details

Forgot password? Click here to reset