Jointly optimal denoising, dereverberation, and source separation

05/20/2020
by   Tomohiro Nakatani, et al.
0

This paper proposes methods that can optimize a Convolutional BeamFormer (CBF) for performing denoising, dereverberation, and source separation (DN+DR+SS) at the same time. Conventionally, cascade configuration composed of a Weighted Prediction Error minimization (WPE) dereverberation filter followed by a Minimum Variance Distortionless Response (MVDR) beamformer has been used as the state-of-the-art frontend of far-field speech recognition, however, overall optimality of this approach is not guaranteed. In the blind signal processing area, an approach for jointly optimizing dereverberation and source separation (DR+SS) has been proposed, however, this approach requires huge computing cost, and has not been extended for application to DN+DR+SS. To overcome the above limitations, this paper develops new approaches for optimizing DN+DR+SS in a computationally much more efficient way. To this end, we introduce two different techniques for factorizing a CBF into WPE filters and beamformers, one based on extension of the conventional joint optimization approach proposed for DR+SS and the other based on a novel factorization technique, and derive methods optimizing them for DN+DR+SS based on the maximum likelihood estimation using a neural network-supported steering vector estimation. Experiments using noisy reverberant sound mixtures show that the proposed optimization approaches greatly improve the performance of the speech enhancement in comparison with the conventional cascade configuration in terms of the signal distortion measures and ASR performance. It is also shown that the proposed approaches can greatly reduce the computing cost with improved estimation accuracy in comparison with the conventional joint optimization approach.

READ FULL TEXT
research
08/04/2021

Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation

This paper proposes an approach for optimizing a Convolutional BeamForme...
research
12/20/2018

A unified convolutional beamformer for simultaneous denoising and dereverberation

This paper proposes a method for estimating a convolutional beamformer t...
research
11/20/2021

Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithm

This paper develops a framework that can perform denoising, dereverberat...
research
10/30/2019

Jointly optimal dereverberation and beamforming

We previously proposed an optimal (in the maximum likelihood sense) conv...
research
02/09/2023

Joint Acoustic Echo Cancellation and Speech Dereverberation Using Kalman filters

This paper proposes a joint acoustic echo cancellation (AEC) and speech ...
research
10/18/2021

Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference

Target source extraction is significant for improving human speech intel...
research
03/09/2020

Tackling real noisy reverberant meetings with all-neural source separation, counting, and diarization system

Automatic meeting analysis is an essential fundamental technology requir...

Please sign up or login with your details

Forgot password? Click here to reset