Deep Neural Networks for Multiple Speaker Detection and Localization

11/30/2017
by   Weipeng He, et al.
0

We propose to use neural networks (NNs) for simultaneous detection and localization of multiple sound sources in Human-Robot Interaction (HRI). Unlike conventional signal processing techniques, NN-based Sound Source Localization (SSL) methods are relatively straightforward and require no or fewer assumptions that hardly hold in real HRI scenarios. Previously, NN-based methods have been successfully applied to single SSL problems, which do not extend to multiple sources in terms of detection and localization. In this paper, we thus propose a likelihood-based encoding of the network output, which naturally allows the detection of an arbitrary number of sources. In addition, we investigate the use of sub-band cross-correlation information as features for better localization in sound mixtures, as well as three different NN architectures based on different processing motivations. Experiments on real data recorded from the robot show that our NN-based methods significantly outperform the popular spatial spectrum-based approaches.

READ FULL TEXT

page 1

page 3

page 4

research
01/20/2023

Adjoint-Based Identification of Sound Sources for Sound Reinforcement and Source Localization

The identification of sound sources is a common problem in acoustics. Di...
research
08/08/2023

Dual input neural networks for positional sound source localization

In many signal processing applications, metadata may be advantageously u...
research
10/29/2020

ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization and Detection

Neural-network (NN)-based methods show high performance in sound event l...
research
12/01/2018

Lightweight and Optimized Sound Source Localization and Tracking Methods for Open and Closed Microphone Array Configurations

Human-robot interaction in natural settings requires filtering out the d...
research
06/24/2022

Iterative Sound Source Localization for Unknown Number of Sources

Sound source localization aims to seek the direction of arrival (DOA) of...
research
10/29/2021

Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers

Data-based and learning-based sound source localization (SSL) has shown ...
research
02/16/2022

SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization

Multiple moving sound source localization in real-world scenarios remain...

Please sign up or login with your details

Forgot password? Click here to reset