Generative adversarial network (GAN)-based vocoders have been intensivel...
Taking long-term spectral and temporal dependencies into account is esse...
The emergence of various notions of “consistency” in diffusion models ha...
Generative adversarial networks (GANs) learn a target probability
distri...
Pre-trained diffusion models have been successfully used as priors in a
...
Removing reverb from reverberant music is a necessary technique to clean...
Although deep neural network (DNN)-based speech enhancement (SE) methods...
Score-based generative models learn a family of noise-conditional score
...
One noted issue of vector-quantized variational autoencoder (VQ-VAE) is ...
Variational autoencoders (VAEs) often suffer from posterior collapse, wh...