A DNN based Normalized Time-frequency Weighted Criterion for Robust Wideband DoA Estimation

02/20/2023
by   Kuan-Lin Chen, et al.
0

Deep neural networks (DNNs) have greatly benefited direction of arrival (DoA) estimation methods for speech source localization in noisy environments. However, their localization accuracy is still far from satisfactory due to the vulnerability to nonspeech interference. To improve the robustness against interference, we propose a DNN based normalized time-frequency (T-F) weighted criterion which minimizes the distance between the candidate steering vectors and the filtered snapshots in the T-F domain. Our method requires no eigendecomposition and uses a simple normalization to prevent the optimization objective from being misled by noisy filtered snapshots. We also study different designs of T-F weights guided by a DNN. We find that duplicating the Hadamard product of speech ratio masks is highly effective and better than other techniques such as direct masking and taking the mean in the proposed approach. However, the best-performing design of T-F weights is criterion-dependent in general. Experiments show that the proposed method outperforms popular DNN based DoA estimation methods including widely used subspace methods in noisy and reverberant environments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2014

Spatial Diffuseness Features for DNN-Based Speech Recognition in Noisy and Reverberant Environments

We propose a spatial diffuseness feature for deep neural network (DNN)-b...
research
04/14/2019

A robust DOA estimation method for a linear microphone array under reverberant and noisy environments

A robust method for linear array is proposed to address the difficulty o...
research
06/29/2021

DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection

Sound event localization and detection consists of two subtasks which ar...
research
02/23/2023

Frequency bin-wise single channel speech presence probability estimation using multiple DNNs

In this work, we propose a frequency bin-wise method to estimate the sin...
research
09/18/2023

Refining DNN-based Mask Estimation using CGMM-based EM Algorithm for Multi-channel Noise Reduction

In this paper, we present a method that allows to further improve speech...
research
01/28/2020

Subband Weighting for Binaural Speech Source Localization

We consider the task of speech source localization from a bin-aural reco...
research
03/24/2022

Repairing Group-Level Errors for DNNs Using Weighted Regularization

Deep Neural Networks (DNNs) have been widely used in software making dec...

Please sign up or login with your details

Forgot password? Click here to reset