A variance modeling framework based on variational autoencoders for speech enhancement

02/05/2019
by   Simon Leglaive, et al.
0

In this paper we address the problem of enhancing speech signals in noisy mixtures using a source separation approach. We explore the use of neural networks as an alternative to a popular speech variance model based on supervised non-negative matrix factorization (NMF). More precisely, we use a variational autoencoder as a speaker-independent supervised generative speech model, highlighting the conceptual similarities that this approach shares with its NMF-based counterpart. In order to be free of generalization issues regarding the noisy recording environments, we follow the approach of having a supervised model only for the target speech signal, the noise model being based on unsupervised NMF. We develop a Monte Carlo expectation-maximization algorithm for inferring the latent variables in the variational autoencoder and estimating the unsupervised model parameters. Experiments show that the proposed method outperforms a semi-supervised NMF baseline and a state-of-the-art fully supervised deep learning approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2018

Semi-supervised multichannel speech enhancement with variational autoencoders and non-negative matrix factorization

In this paper we address speaker-independent multichannel speech enhance...
research
04/05/2019

Unsupervised Low Latency Speech Enhancement with RT-GCC-NMF

In this paper, we present RT-GCC-NMF: a real-time (RT), two-channel blin...
research
02/08/2019

Speech enhancement with variational autoencoders and alpha-stable distributions

This paper focuses on single-channel semi-supervised speech enhancement....
research
08/17/2020

Deep Variational Generative Models for Audio-visual Speech Separation

In this paper, we are interested in audio-visual speech separation given...
research
06/13/2023

Unsupervised speech enhancement with deep dynamical generative speech and noise models

This work builds on a previous work on unsupervised speech enhancement u...
research
04/07/2022

Inference over radiative transfer models using variational and expectation maximization methods

Earth observation from satellites offers the possibility to monitor our ...
research
01/10/2022

Noisy Neonatal Chest Sound Separation for High-Quality Heart and Lung Sounds

Stethoscope-recorded chest sounds provide the opportunity for remote car...

Please sign up or login with your details

Forgot password? Click here to reset