Diffusion Posterior Sampling for Informed Single-Channel Dereverberation

06/21/2023
by   Jean-Marie Lemercier, et al.
0

We present in this paper an informed single-channel dereverberation method based on conditional generation with diffusion models. With knowledge of the room impulse response, the anechoic utterance is generated via reverse diffusion using a measurement consistency criterion coupled with a neural network that represents the clean speech prior. The proposed approach is largely more robust to measurement noise compared to a state-of-the-art informed single-channel dereverberation method, especially for non-stationary noise. Furthermore, we compare to other blind dereverberation methods using diffusion models and show superiority of the proposed approach for large reverberation times. We motivate the presented algorithm by introducing an extension for blind dereverberation allowing joint estimation of the room impulse response and anechoic speech. Audio samples and code can be found online (https://uhh.de/inf-sp-derev-dps).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/09/2019

Impulse Response Data Augmentation and Deep Neural Networks for Blind Room Acoustic Parameter Estimation

The reverberation time (T60) and the direct-to-reverberant ratio (DRR) a...
research
09/18/2023

Single and Few-step Diffusion for Generative Speech Enhancement

Diffusion models have shown promising results in speech enhancement, usi...
research
08/11/2022

Speech Enhancement and Dereverberation with Diffusion-based Generative Models

Recently, diffusion-based generative models have been introduced to the ...
research
07/29/2021

Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings

Knowing the geometrical and acoustical parameters of a room may benefit ...
research
06/22/2023

Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model

In this paper we present a method for single-channel wind noise reductio...
research
02/13/2021

Multi-Channel Speech Enhancement using Graph Neural Networks

Multi-channel speech enhancement aims to extract clean speech from a noi...
research
06/02/2023

Zero-Shot Blind Audio Bandwidth Extension

Audio bandwidth extension involves the realistic reconstruction of high-...

Please sign up or login with your details

Forgot password? Click here to reset