State-of-the-art text-to-speech (TTS) systems have utilized pretrained
l...
We present eCat, a novel end-to-end multispeaker model capable of: a)
ge...
Generating expressive and contextually appropriate prosody remains a
cha...
Duration modelling has become an important research problem once more wi...
In this paper, we present CopyCat2 (CC2), a novel model capable of: a)
s...
We propose a novel Multi-Scale Spectrogram (MSS) modelling approach to
s...
Many factors influence speech yielding different renditions of a given
s...
In this paper, we introduce Kathaka, a model trained with a novel two-st...
Prosody Transfer (PT) is a technique that aims to use the prosody from a...