Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers

10/29/2021
by   Sharath Adavanne, et al.
0

Data-based and learning-based sound source localization (SSL) has shown promising results in challenging conditions, and is commonly set as a classification or a regression problem. Regression-based approaches have certain advantages over classification-based, such as continuous direction-of-arrival estimation of static and moving sources. However, multi-source scenarios require multiple regressors without a clear training strategy up-to-date, that does not rely on auxiliary information such as simultaneous sound classification. We investigate end-to-end training of such methods with a technique recently proposed for video object detectors, adapted to the SSL setting. A differentiable network is constructed that can be plugged to the output of the localizer to solve the optimal assignment between predictions and references, optimizing directly the popular CLEAR-MOT tracking metrics. Results indicate large improvements over directly optimizing mean squared errors, in terms of localization error, detection metrics, and tracking capabilities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2022

Position tracking of a varying number of sound sources with sliding permutation invariant training

Recent data- and learning-based sound source localization (SSL) methods ...
research
09/08/2021

A Survey of Sound Source Localization with Deep Learning Methods

This article is a survey on deep learning methods for single and multipl...
research
12/10/2020

Learning Multiple Sound Source 2D Localization

In this paper, we propose novel deep learning based algorithms for multi...
research
09/17/2023

Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions

Localizing a moving sound source in the real world involves determining ...
research
11/30/2017

Deep Neural Networks for Multiple Speaker Detection and Localization

We propose to use neural networks (NNs) for simultaneous detection and l...
research
11/30/2022

How to (virtually) train your sound source localizer

Learning-based methods have become ubiquitous in sound source localizati...

Please sign up or login with your details

Forgot password? Click here to reset