A unified convolutional beamformer for simultaneous denoising and dereverberation

12/20/2018
by   Tomohiro Nakatani, et al.
0

This paper proposes a method for estimating a convolutional beamformer that can perform denoising and dereverberation simultaneously in an optimal way. The application of dereverberation based on weighted prediction error (WPE) followed by denoising based on a minimum variance distortionless response beamformer (MVDR) has conventionally been considered a promising approach, however, the optimality of this approach is not guaranteed. To realize the optimal integration of denoising and dereverberation, we present a method that unifies WPE and a variant of MVDR, namely a minimum power distortionless response beamformer (MPDR), into a single convolutional beamformer, and we optimize it based on a single unified optimization criterion. The proposed beamformer is referred to as a Weighted Power minimization Distortionless response beamformer (WPD). Experiments show that the proposed method substantially improves the speech enhancement performance in terms of both objective speech enhancement measures and automatic speech recognition (ASR).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/06/2019

Maximum likelihood convolutional beamformer for simultaneous denoising and dereverberation

This article describes a probabilistic formulation of a Weighted Power m...
research
10/19/2022

End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation

Self-supervised learning representation (SSLR) has demonstrated its sign...
research
05/20/2020

Jointly optimal denoising, dereverberation, and source separation

This paper proposes methods that can optimize a Convolutional BeamFormer...
research
06/03/2021

Joint Multi-Channel Dereverberation and Noise Reduction Using a Unified Convolutional Beamformer With Sparse Priors

Recently, the convolutional weighted power minimization distortionless r...
research
10/30/2019

Jointly optimal dereverberation and beamforming

We previously proposed an optimal (in the maximum likelihood sense) conv...
research
04/19/2019

Dry, Focus, and Transcribe: End-to-End Integration of Dereverberation, Beamforming, and ASR

Sequence-to-sequence (S2S) modeling is becoming a popular paradigm for a...
research
02/23/2023

A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

In this study, we present an approach to train a single speech enhanceme...

Please sign up or login with your details

Forgot password? Click here to reset