research
∙
07/04/2023
Disentanglement in a GAN for Unconditional Speech Synthesis
Can we develop a model that can synthesize realistic speech directly fro...
research
∙
05/30/2023
Voice Conversion With Just Nearest Neighbors
Any-to-any voice conversion aims to transform source speech into a targe...
research
∙
10/14/2022
TransFusion: Transcribing Speech with Multinomial Diffusion
Diffusion models have shown exceptional scaling properties in the image ...
research
∙
10/11/2022
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
We propose AudioStyleGAN (ASGAN), a new generative adversarial network (...
research
∙
11/04/2021
Voice Conversion Can Improve ASR in Very Low-Resource Settings
Voice conversion (VC) has been proposed to improve speech recognition sy...
research
∙
08/02/2021
Analyzing Speaker Information in Self-Supervised Models to Improve Zero-Resource Speech Processing
Contrastive predictive coding (CPC) aims to learn representations of spe...
research
∙
05/31/2021