TASK3 DCASE2021 Challenge: Sound event localization and detection using squeeze-excitation residual CNNs

07/30/2021
by   Javier Naranjo-Alcazar, et al.
0

Sound event localisation and detection (SELD) is a problem in the field of automatic listening that aims at the temporal detection and localisation (direction of arrival estimation) of sound events within an audio clip, usually of long duration. Due to the amount of data present in the datasets related to this problem, solutions based on deep learning have positioned themselves at the top of the state of the art. Most solutions are based on 2D representations of the audio (different spectrograms) that are processed by a convolutional-recurrent network. The motivation of this submission is to study the squeeze-excitation technique in the convolutional part of the network and how it improves the performance of the system. This study is based on the one carried out by the same team last year. This year, it has been decided to study how this technique improves each of the datasets (last year only the MIC dataset was studied). This modification shows an improvement in the performance of the system compared to the baseline using MIC dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/25/2020

Sound Event Localization and Detection using Squeeze-Excitation Residual CNNs

Sound Event Localization and Detection (SELD) is a problem related to th...
research
10/22/2019

Sound Event Localization and Detection Using CRNN on Pairs of Microphones

This paper proposes sound event localization and detection methods from ...
research
02/28/2021

Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization

Sound event localization frameworks based on deep neural networks have s...
research
08/27/2019

A hybrid parametric-deep learning approach for sound event localization and detection

This work describes and discusses an algorithm submitted to the Sound Ev...
research
03/03/2020

SELD-TCN: Sound Event Localization Detection via Temporal Convolutional Networks

The understanding of the surrounding environment plays a critical role i...
research
09/26/2022

Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection

Many state-of-the-art systems for audio tagging and sound event detectio...
research
06/27/2023

MAE-GEBD:Winning the CVPR'2023 LOVEU-GEBD Challenge

The Generic Event Boundary Detection (GEBD) task aims to build a model f...

Please sign up or login with your details

Forgot password? Click here to reset