Sound Event Localization and Detection Using Activity-Coupled Cartesian DOA Vector and RD3net

06/22/2020
by   Kazuki Shimada, et al.
0

Our systems submitted to the DCASE2020 task 3: Sound Event Localization and Detection (SELD) are described in this report. We consider two systems: a single-stage system that solve sound event localization (SEL) and sound event detection (SED) simultaneously, and a two-stage system that first handles the SED and SEL tasks individually and later combines those results. As the single-stage system, we propose a unified training framework that uses an activity-coupled Cartesian DOA vector (ACCDOA) representation as a single target for both the SED and SEL tasks. To efficiently estimate sound event locations and activities, we further propose RD3Net, which incorporates recurrent and convolution layers with dense skip connections and dilation. To generalize the models, we apply three data augmentation techniques: equalized mixture data augmentation (EMDA), rotation of first-order Ambisonic (FOA) singals, and multichannel extension of SpecAugment. Our systems demonstrate a significant improvement over the baseline system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/21/2021

Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection

This report describes our systems submitted to the DCASE2021 challenge t...
research
10/29/2020

ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization and Detection

Neural-network (NN)-based methods show high performance in sound event l...
research
10/13/2020

Sound event localization and detection based on crnn using rectangular filters and channel rotation data augmentation

Sound Event Localization and Detection refers to the problem of identify...
research
09/06/2020

Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019

Sound event localization and detection is a novel area of research that ...
research
10/25/2022

CoLoC: Conditioned Localizer and Classifier for Sound Event Localization and Detection

In this article, we describe Conditioned Localizer and Classifier (CoLoC...
research
03/19/2022

A Track-Wise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection

Polyphonic sound event localization and detection (SELD) aims at detecti...
research
06/25/2020

Sound Event Localization and Detection using Squeeze-Excitation Residual CNNs

Sound Event Localization and Detection (SELD) is a problem related to th...

Please sign up or login with your details

Forgot password? Click here to reset