SELD-TCN: Sound Event Localization Detection via Temporal Convolutional Networks

03/03/2020
by   Karim Guirguis, et al.
0

The understanding of the surrounding environment plays a critical role in autonomous robotic systems, such as self-driving cars. Extensive research has been carried out concerning visual perception. Yet, to obtain a more complete perception of the environment, autonomous systems of the future should also take acoustic information into account. Recent sound event localization and detection (SELD) frameworks utilize convolutional recurrent neural networks (CRNNs). However, considering the recurrent nature of CRNNs, it becomes challenging to implement them efficiently on embedded hardware. Not only are their computations strenuous to parallelize, but they also require high memory bandwidth and large memory buffers. In this work, we develop a more robust and hardware-friendly novel architecture based on a temporal convolutional network(TCN). The proposed framework (SELD-TCN) outperforms the state-of-the-art SELDnet performance on four different datasets. Moreover, SELD-TCN achieves 4x faster training time per epoch and 40x faster inference time on an ordinary graphics processing unit (GPU).

READ FULL TEXT

page 1

page 2

research
06/07/2021

PILOT: Introducing Transformers for Probabilistic Sound Event Localization

Sound event localization aims at estimating the positions of sound sourc...
research
09/26/2022

Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection

Many state-of-the-art systems for audio tagging and sound event detectio...
research
02/01/2023

EfficientRep:An Efficient Repvgg-style ConvNets with Hardware-aware Neural Network Design

We present a hardware-efficient architecture of convolutional neural net...
research
07/30/2021

TASK3 DCASE2021 Challenge: Sound event localization and detection using squeeze-excitation residual CNNs

Sound event localisation and detection (SELD) is a problem in the field ...
research
10/18/2022

Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection

In this technical report, the systems we submitted for subtask 4 of the ...
research
07/16/2019

Separable Convolutional LSTMs for Faster Video Segmentation

Semantic Segmentation is an important module for autonomous robots such ...
research
03/29/2019

Deep, spatially coherent Inverse Sensor Models with Uncertainty Incorporation using the evidential Framework

To perform high speed tasks, sensors of autonomous cars have to provide ...

Please sign up or login with your details

Forgot password? Click here to reset