Text-to-speech synthesis (TTS) is a task to convert texts into speech. T...
End-to-end text-to-speech synthesis (TTS) can generate highly natural
sy...
This paper describes ESPnet2-TTS, an end-to-end text-to-speech (E2E-TTS)...
We explore pretraining strategies including choice of base corpus with t...
We have been working on speech synthesis for rakugo (a traditional Japan...
Explicit duration modeling is a key to achieving robust and efficient
al...
Neural sequence-to-sequence text-to-speech synthesis (TTS) can produce
h...
Sequence-to-sequence text-to-speech (TTS) is dominated by
soft-attention...
End-to-end text-to-speech (TTS) synthesis is a method that directly conv...
End-to-end speech synthesis is a promising approach that directly conver...