SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization

02/16/2022
by   Bing Yang, et al.
0

Multiple moving sound source localization in real-world scenarios remains a challenging issue due to interaction between sources, time-varying trajectories, distorted spatial cues, etc. In this work, we propose to use deep learning techniques to learn competing and time-varying direct-path phase differences for localizing multiple moving sound sources. A causal convolutional recurrent neural network is designed to extract the direct-path phase difference sequence from signals of each microphone pair. To avoid the assignment ambiguity and the problem of uncertain output-dimension encountered when simultaneously predicting multiple targets, the learning target is designed in a weighted sum format, which encodes source activity in the weight and direct-path phase differences in the summed value. The learned direct-path phase differences for all microphone pairs can be directly used to construct the spatial spectrum according to the formulation of steered response power (SRP). This deep neural network (DNN) based SRP method is referred to as SRP-DNN. The locations of sources are estimated by iteratively detecting and removing the dominant source from the spatial spectrum, in which way the interaction between sources is reduced. Experimental results on both simulated and real-world data show the superiority of the proposed method in the presence of noise and reverberation.

READ FULL TEXT
research
02/16/2022

Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization

Direct-path relative transfer function (DP-RTF) refers to the ratio betw...
research
10/10/2021

Direct source and early reflections localization using deep deconvolution network under reverberant environment

This paper proposes a deconvolution-based network (DCNN) model for DOA e...
research
04/29/2019

Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network

This paper investigates the joint localization, detection, and tracking ...
research
10/26/2022

Position tracking of a varying number of sound sources with sliding permutation invariant training

Recent data- and learning-based sound source localization (SSL) methods ...
research
12/07/2020

Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function

This paper addresses the problem of sound-source localization (SSL) with...
research
11/30/2017

Deep Neural Networks for Multiple Speaker Detection and Localization

We propose to use neural networks (NNs) for simultaneous detection and l...
research
04/05/2019

Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks

Despite there being clear evidence for top-down (e.g., attentional) effe...

Please sign up or login with your details

Forgot password? Click here to reset