We work to create a multilingual speech synthesis system which can gener...
Animating portraits using speech has received growing attention in recen...
Despite recent advances in generative modeling for text-to-speech synthe...
Speech-to-text alignment is a critical component of neural textto-speech...
generative network for text-to-speech synthesis with control over speec...
We propose a novel approach for image segmentation that combines Neural
...
Mellotron is a multispeaker voice synthesis model based on Tacotron 2 GS...
In this paper we propose WaveGlow: a flow-based network capable of gener...
This paper describes computational methods for the visual display and
an...
In this paper we show strategies to easily identify fake samples generat...
In this paper we investigate the ability of generative adversarial netwo...
The paper approaches the problem of image-to-text with attention-based
e...
This paper compares methods for imputing missing categorical data for
su...