Sound Event Detection Using Duration Robust Loss Function

06/27/2020
by   Daichi Akiyama, et al.
0

Many methods of sound event detection (SED) based on machine learning regard a segmented time frame as one data sample to model training. However, the sound durations of sound events vary greatly depending on the sound event class, e.g., the sound event “fan” has a long time duration, while the sound event “mouse clicking” is instantaneous. The difference in the time duration between sound event classes thus causes a serious data imbalance problem in SED. In this paper, we propose a method for SED using a duration robust loss function, which can focus model training on sound events of short duration. In the proposed method, we focus on a relationship between the duration of the sound event and the ease/difficulty of model training. In particular, many sound events of long duration (e.g., sound event “fan”) are stationary sounds, which have less variation in their acoustic features and their model training is easy. Meanwhile, some sound events of short duration (e.g., sound event “object impact”) have more than one audio pattern, such as attack, decay, and release parts. We thus apply a class-wise reweighting to the binary-cross entropy loss function depending on the ease/difficulty of model training. Evaluation experiments conducted using TUT Sound Events 2016/2017 and TUT Acoustic Scenes 2016 datasets show that the proposed method respectively improves the detection performance of sound events by 3.15 and 4.37 percentage points in macro- and micro-Fscores compared with a conventional method using the binary-cross entropy loss function.

READ FULL TEXT
research
02/03/2021

Impact of Sound Duration and Inactive Frames on Sound Event Detection Performance

In many methods of sound event detection (SED), a segmented time frame i...
research
07/22/2021

Improving Polyphonic Sound Event Detection on Multichannel Recordings with the Sørensen-Dice Coefficient Loss and Transfer Learning

The Sørensen–Dice Coefficient has recently seen rising popularity as a l...
research
02/10/2021

Sound Event Detection Based on Curriculum Learning Considering Learning Difficulty of Events

In conventional sound event detection (SED) models, two types of events,...
research
11/04/2020

Influence of Event Duration on Automatic Wheeze Classification

Patients with respiratory conditions typically exhibit adventitious resp...
research
12/08/2020

Split: Inferring Unobserved Event Probabilities for Disentangling Brand-Customer Interactions

Often, data contains only composite events composed of multiple events, ...
research
11/30/2017

Direct Segmented Sonification of Characteristic Features of the Data Domain

Sonification and audification create auditory displays of datasets. Audi...
research
04/05/2019

Modelling of Sound Events with Hidden Imbalances Based on Clustering and Separate Sub-Dictionary Learning

This paper proposes an effective modelling of sound event spectra with a...

Please sign up or login with your details

Forgot password? Click here to reset