Position tracking of a varying number of sound sources with sliding permutation invariant training

10/26/2022
by   David Diaz-Guerra, et al.
0

Recent data- and learning-based sound source localization (SSL) methods have shown strong performance in challenging acoustic scenarios. However, little work has been done on adapting such methods to track consistently multiple sources appearing and disappearing, as would occur in reality. In this paper, we present a new training strategy for deep learning SSL models with a straightforward implementation based on the mean squared error of the optimal association between estimated and reference positions in the preceding time frames. It optimizes the desired properties of a tracking system: handling a time-varying number of sources and ordering localization estimates according to their trajectories, minimizing identity switches (IDSs). Evaluation on simulated data of multiple reverberant moving sources and on two model architectures proves its effectiveness on reducing identity switches without compromising frame-wise localization accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2019

Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network

This paper investigates the joint localization, detection, and tracking ...
research
10/29/2021

Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers

Data-based and learning-based sound source localization (SSL) has shown ...
research
06/14/2023

Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications

Many multi-source localization and tracking models based on neural netwo...
research
09/03/2019

The LOCATA Challenge: Acoustic Source Localization and Tracking

The ability to localize and track acoustic events is a fundamental prere...
research
02/16/2022

SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization

Multiple moving sound source localization in real-world scenarios remain...
research
06/28/2023

Sequential Attention Source Identification Based on Feature Representation

Snapshot observation based source localization has been widely studied d...
research
04/09/2022

Finding the Right Place: Sensor Placement for UWB Time Difference of Arrival Localization in Cluttered Indoor Environments

Ultra-wideband (UWB) time difference of arrival (TDOA)-based localizatio...

Please sign up or login with your details

Forgot password? Click here to reset