A Recurrent Variational Autoencoder for Speech Enhancement

10/24/2019
by   Simon Leglaive, et al.
0

This paper presents a generative approach to speech enhancement based on a recurrent variational autoencoder (RVAE). The deep generative speech model is trained using clean speech signals only, and it is combined with a nonnegative matrix factorization noise model for speech enhancement. We propose a variational expectation-maximization algorithm where the encoder of the RVAE is fine-tuned at test time, to approximate the distribution of the latent variables given the noisy speech observations. Compared with previous approaches based on feed-forward fully-connected architectures, the proposed recurrent deep generative speech model induces a posterior temporal dynamic over the latent variables, which is shown to improve the speech enhancement results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/23/2021

Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders

Dynamical variational auto-encoders (DVAEs) are a class of deep generati...
research
08/07/2019

Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoder

Variational auto-encoders (VAEs) are deep generative latent variable mod...
research
05/19/2021

Disentanglement Learning for Variational Autoencoders Applied to Audio-Visual Speech Enhancement

Recently, the standard variational autoencoder has been successfully use...
research
09/15/2023

RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function

In indoor scenes, reverberation is a crucial factor in degrading the per...
research
02/08/2019

Speech enhancement with variational autoencoders and alpha-stable distributions

This paper focuses on single-channel semi-supervised speech enhancement....
research
11/04/2020

Can We Trust Deep Speech Prior?

Recently, speech enhancement (SE) based on deep speech prior has attract...
research
05/03/2019

A Statistically Principled and Computationally Efficient Approach to Speech Enhancement using Variational Autoencoders

Recent studies have explored the use of deep generative models of speech...

Please sign up or login with your details

Forgot password? Click here to reset