Impact of Sound Duration and Inactive Frames on Sound Event Detection Performance

02/03/2021
by   Keisuke Imoto, et al.
0

In many methods of sound event detection (SED), a segmented time frame is regarded as one data sample to model training. The durations of sound events greatly depend on the sound event class, e.g., the sound event "fan" has a long duration, whereas the sound event "mouse clicking" is instantaneous. Thus, the difference in the duration between sound event classes results in a serious data imbalance in SED. Moreover, most sound events tend to occur occasionally; therefore, there are many more inactive time frames of sound events than active frames. This also causes a severe data imbalance between active and inactive frames. In this paper, we investigate the impact of sound duration and inactive frames on SED performance by introducing four loss functions, such as simple reweighting loss, inverse frequency loss, asymmetric focal loss, and focal batch Tversky loss. Then, we provide insights into how we tackle this imbalance problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2020

Sound Event Detection Using Duration Robust Loss Function

Many methods of sound event detection (SED) based on machine learning re...
research
08/20/2018

A simple model for detection of rare sound events

We propose a simple recurrent model for detecting rare sound events, whe...
research
11/04/2020

Influence of Event Duration on Automatic Wheeze Classification

Patients with respiratory conditions typically exhibit adventitious resp...
research
02/10/2021

Sound Event Detection Based on Curriculum Learning Considering Learning Difficulty of Events

In conventional sound event detection (SED) models, two types of events,...
research
10/26/2020

Improving Sound Event Detection Metrics: Insights from DCASE 2020

The ranking of sound event detection (SED) systems may be biased by assu...
research
04/05/2022

RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection

Target sound detection (TSD) aims to detect the target sound from a mixt...
research
03/26/2020

Incremental Learning Algorithm for Sound Event Detection

This paper presents a new learning strategy for the Sound Event Detectio...

Please sign up or login with your details

Forgot password? Click here to reset