RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function

09/15/2023
by   Pengyu Wang, et al.
0

In indoor scenes, reverberation is a crucial factor in degrading the perceived quality and intelligibility of speech. In this work, we propose a generative dereverberation method. Our approach is based on a probabilistic model utilizing a recurrent variational auto-encoder (RVAE) network and the convolutive transfer function (CTF) approximation. Different from most previous approaches, the output of our RVAE serves as the prior of the clean speech. And our target is the maximum a posteriori (MAP) estimation of clean speech, which is achieved iteratively through the expectation maximization (EM) algorithm. The proposed method integrates the capabilities of network-based speech prior modelling and CTF-based observation modelling. Experiments on single-channel speech dereverberation show that the proposed generative method noticeably outperforms the advanced discriminative networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2020

Deep Variational Generative Models for Audio-visual Speech Separation

In this paper, we are interested in audio-visual speech separation given...
research
10/24/2019

A Recurrent Variational Autoencoder for Speech Enhancement

This paper presents a generative approach to speech enhancement based on...
research
11/04/2020

Can We Trust Deep Speech Prior?

Recently, speech enhancement (SE) based on deep speech prior has attract...
research
02/13/2018

Tighter Variational Bounds are Not Necessarily Better

We provide theoretical and empirical evidence that using tighter evidenc...
research
04/17/2019

Effective Estimation of Deep Generative Language Models

Advances in variational inference enable parameterisation of probabilist...
research
08/20/2019

Learning document embeddings along with their uncertainties

Majority of the text modelling techniques yield only point estimates of ...
research
02/24/2019

Iterative Channel Estimation for Discrete Denoising under Channel Uncertainty

We propose a novel iterative channel estimation (ICE) algorithm that ess...

Please sign up or login with your details

Forgot password? Click here to reset