Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms

01/03/2022
by   Wolfgang Mack, et al.
0

The direction-of-arrival (DOA) of sound sources is an essential acoustic parameter used, e.g., for multi-channel speech enhancement or source tracking. Complex acoustic scenarios consisting of sources-of-interest, interfering sources, reverberation, and noise make the estimation of the DOAs corresponding to the sources-of-interest a challenging task. Recently proposed attention mechanisms allow DOA estimators to focus on the sources-of-interest and disregard interference and noise, i.e., they are signal-aware. The attention is typically obtained by a deep neural network (DNN) from a short-time Fourier transform (STFT) based representation of a single microphone signal. Subsequently, attention has been applied as binary or ratio weighting to STFT-based microphone signal representations to reduce the impact of frequency bins dominated by noise, interference, or reverberation. The impact of attention on DOA estimators and different training strategies for attention and DOA DNNs are not yet studied in depth. In this paper, we evaluate systems consisting of different DNNs and signal processing-based methods for DOA estimation when attention is applied. Additionally, we propose training strategies for attention-based DOA estimation optimized via a DOA objective, i.e., end-to-end. The evaluation of the proposed and the baseline systems is performed using data generated with simulated and measured room impulse responses under various acoustic conditions, like reverberation times, noise, and source array distances. Overall, DOA estimation using attention in combination with signal-processing methods exhibits a far lower computational complexity than a fully DNN-based system; however, it yields comparable results.

READ FULL TEXT

page 27

page 31

page 32

research
02/10/2019

Performance Advantages of Deep Neural Networks for Angle of Arrival Estimation

The problem of estimating the number of sources and their angles of arri...
research
03/18/2020

Multi-Source DOA Estimation through Pattern Recognition of the Modal Coherence of a Reverberant Soundfield

We propose a novel multi-source direction of arrival (DOA) estimation te...
research
11/25/2019

Invertible DNN-based nonlinear time-frequency transform for speech enhancement

We propose an end-to-end speech enhancement method with trainable time-f...
research
04/27/2021

dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal Processing

This paper presents dEchorate: a new database of measured multichannel R...
research
12/04/2018

Localization and Tracking of an Acoustic Source using a Diagonal Unloading Beamforming and a Kalman Filter

We present the signal processing framework and some results for the IEEE...
research
04/21/2021

The Maximal Eigengap Estimator for Acoustic Vector-Sensor Processing

This paper introduces the maximal eigengap estimator for finding the dir...
research
03/28/2022

Multi-source wideband doa estimation method by frequency focusing and error weighting

In this paper, a new multi-source wideband direction of arrival (MSW-DOA...

Please sign up or login with your details

Forgot password? Click here to reset