Neural vocoders based on the generative adversarial neural network (GAN)...
The recently developed pitch-controllable text-to-speech (TTS) model, i....
This paper proposes a hierarchical and multi-scale variational
autoencod...
Recently, it has become easier to obtain speech data from various media ...
This paper proposes a controllable end-to-end text-to-speech (TTS) syste...