Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation

08/04/2021
by   Tomohiro Nakatani, et al.
0

This paper proposes an approach for optimizing a Convolutional BeamFormer (CBF) that can jointly perform denoising (DN), dereverberation (DR), and source separation (SS). First, we develop a blind CBF optimization algorithm that requires no prior information on the sources or the room acoustics, by extending a conventional joint DR and SS method. For making the optimization computationally tractable, we incorporate two techniques into the approach: the Source-Wise Factorization (SW-Fact) of a CBF and the Independent Vector Extraction (IVE). To further improve the performance, we develop a method that integrates a neural network(NN) based source power spectra estimation with CBF optimization by an inverse-Gamma prior. Experiments using noisy reverberant mixtures reveal that our proposed method with both blind and NN-guided scenarios greatly outperforms the conventional state-of-the-art NN-supported mask-based CBF in terms of the improvement in automatic speech recognition and signal distortion reduction performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2020

Jointly optimal denoising, dereverberation, and source separation

This paper proposes methods that can optimize a Convolutional BeamFormer...
research
11/20/2021

Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithm

This paper develops a framework that can perform denoising, dereverberat...
research
02/09/2023

Joint Acoustic Echo Cancellation and Speech Dereverberation Using Kalman filters

This paper proposes a joint acoustic echo cancellation (AEC) and speech ...
research
09/04/2020

Lorentzian Peak Sharpening and Sparse Blind Source Separation for NMR Spectroscopy

In this paper, we introduce a preprocessing technique for blind source s...
research
08/01/2020

Efficient Independent Vector Extraction of Dominant Target Speech

The complete decomposition performed by blind source separation is compu...
research
03/31/2022

Perceptive, non-linear Speech Processing and Spiking Neural Networks

Source separation and speech recognition are very difficult in the conte...
research
11/05/2021

Blind Extraction of Target Speech Source Guided by Supervised Speaker Identification via X-vectors

This manuscript proposes a novel robust procedure for extraction of a sp...

Please sign up or login with your details

Forgot password? Click here to reset