Sound Event Localization based on Sound Intensity Vector Refined By DNN-Based Denoising and Source Separation

02/14/2020
by   Masahiro Yasuda, et al.
0

We propose a direction-of-arrival (DOA) estimation method for Sound Event Localization and Detection (SELD). Direct estimation of DOA using a deep neural network (DNN), i.e. completely-datadriven approach, achieves high accuracy. However, there is a gap in the accuracy between DOA estimation for single and overlapping sources because they cannot incorporate physical knowledge. Meanwhile, although the accuracy of physics-based approaches is inferior to DNN-based approaches, it is robust for overlapping source. In this study, we consider a combination of physics-based and DNN-based approaches; the sound intensity vectors (IVs) for physics-based DOA estimation is refined based on DNN-based denoising and source separation. This method enables the accurate DOA estimation for both single and overlapping sources using a spherical microphone array. Experimental results show that the proposed method achieves state-of-the-art DOA estimation accuracy on an open dataset of the SELD.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/10/2019

DOA Estimation by DNN-based Denoising and Dereverberation from Sound Intensity Vector

We propose a direction of arrival (DOA) estimation method that combines ...
research
05/01/2019

Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy

Sound event detection (SED) and localization refer to recognizing sound ...
research
10/22/2020

DBNET: DOA-driven beamforming network for end-to-end farfield sound source separation

Many deep learning techniques are available to perform source separation...
research
05/06/2021

Weakly Supervised Source-Specific Sound Level Estimation in Noisy Soundscapes

While the estimation of what sound sources are, when they occur, and fro...
research
10/14/2019

Physics-Informed Deep Neural Network Method for Limited Observability State Estimation

The precise knowledge regarding the state of the power grid is important...
research
10/01/2021

SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and Detection

Sound event localization and detection (SELD) consists of two subtasks, ...
research
08/22/2019

Sound Localization and Separation in Three-dimensional Space Using a Single Microphone with a Metamaterial Enclosure

Conventional approaches to sound localization and separation are based o...

Please sign up or login with your details

Forgot password? Click here to reset