A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection

06/13/2021
by   Archontis Politis, et al.
0

This report presents the dataset and baseline of Task 3 of the DCASE2021 Challenge on Sound Event Localization and Detection (SELD). The dataset is based on emulation of real recordings of static or moving sound events under real conditions of reverberation and ambient noise, using spatial room impulse responses captured in a variety of rooms and delivered in two spatial formats. The acoustical synthesis remains the same as in the previous iteration of the challenge, however the new dataset brings more challenging conditions of polyphony and overlapping instances of the same class. The most important difference of the new dataset is the introduction of directional interferers, meaning sound events that are localized in space but do not belong to the target classes to be detected and are not annotated. Since such interfering events are expected in every real-world scenario of SELD, the new dataset aims to promote systems that deal with this condition effectively. A modified SELDnet baseline employing the recent ACCDOA representation of SELD problems accompanies the dataset and it is shown to outperform the previous one. The new dataset is shown to be significantly more challenging for both baselines according to all considered metrics. To investigate the individual and combined effects of ambient noise, interferers, and reverberation, we study the performance of the baseline on different versions of the dataset excluding or including combinations of these factors. The results indicate that by far the most detrimental effects are caused by directional interferers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2020

A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection

This report presents the dataset and the evaluation setup of the Sound E...
research
01/24/2023

Perceptual evaluation of listener envelopment using spatial granular synthesis

Listener envelopment refers to the sensation of being surrounded by soun...
research
06/04/2022

STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events

This report presents the Sony-TAu Realistic Spatial Soundscapes 2022 (ST...
research
05/21/2019

A multi-room reverberant dataset for sound event localization and detection

This paper presents the sound event localization and detection (SELD) ta...
research
02/21/2022

L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment

The L3DAS22 Challenge is aimed at encouraging the development of machine...
research
05/29/2023

Multi-Band Acoustic Monitoring of Aerial Signatures

The Galileo Project's acoustic monitoring, omni-directional system (AMOS...
research
03/08/2022

Locate This, Not That: Class-Conditioned Sound Event DOA Estimation

Existing systems for sound event localization and detection (SELD) typic...

Please sign up or login with your details

Forgot password? Click here to reset