This paper describes Tacotron 2, a neural network architecture for speec...
A text-to-speech synthesis system typically consists of multiple stages,...
Developers of text-to-speech synthesizers (TTS) often make use of human
...
Acoustic models based on long short-term memory recurrent neural network...