Dual input neural networks for positional sound source localization

08/08/2023
by   Eric Grinstein, et al.
0

In many signal processing applications, metadata may be advantageously used in conjunction with a high dimensional signal to produce a desired output. In the case of classical Sound Source Localization (SSL) algorithms, information from a high dimensional, multichannel audio signals received by many distributed microphones is combined with information describing acoustic properties of the scene, such as the microphones' coordinates in space, to estimate the position of a sound source. We introduce Dual Input Neural Networks (DI-NNs) as a simple and effective way to model these two data types in a neural network. We train and evaluate our proposed DI-NN on scenarios of varying difficulty and realism and compare it against an alternative architecture, a classical Least-Squares (LS) method as well as a classical Convolutional Recurrent Neural Network (CRNN). Our results show that the DI-NN significantly outperforms the baselines, achieving a five times lower localization error than the LS method and two times lower than the CRNN in a test dataset of real recordings.

READ FULL TEXT
research
06/28/2023

Graph neural networks for sound source localization on distributed microphone networks

Distributed Microphone Arrays (DMAs) present many challenges with respec...
research
11/30/2017

Deep Neural Networks for Multiple Speaker Detection and Localization

We propose to use neural networks (NNs) for simultaneous detection and l...
research
07/29/2018

Towards End-to-End Acoustic Localization using Deep Learning: from Audio Signal to Source Position Coordinates

This paper presents a novel approach for indoor acoustic source localiza...
research
12/09/2012

High-dimensional sequence transduction

We investigate the problem of transforming an input sequence into a high...
research
06/01/2021

Dual Normalization Multitasking for Audio-Visual Sounding Object Localization

Although several research works have been reported on audio-visual sound...
research
09/08/2021

A Survey of Sound Source Localization with Deep Learning Methods

This article is a survey on deep learning methods for single and multipl...
research
10/22/2022

Neural Sound Field Decomposition with Super-resolution of Sound Direction

Sound field decomposition predicts waveforms in arbitrary directions usi...

Please sign up or login with your details

Forgot password? Click here to reset