Duration robust sound event detection

04/08/2019
by   Heinrich Dinkel, et al.
0

Task 4 of the Dcase2018 challenge demonstrated that substantially more research is needed for a real-world application of sound event detection. Analyzing the challenge results it can be seen that most successful models are biased towards predicting long (e.g., over 5s) utterances. This work aims to investigate the performance impact of fixed sized window median filter post-processing and advocate the use of double thresholding as a more robust and predictable post-processing method. Further, four different temporal subsampling methods within the CRNN framework are proposed: mean-max, alpha-mean-max, Lp-norm and convolutional. We show that for this task subsampling the temporal resolution by a neural network enhances the F1 score as well as onset and offset accuracies. Our best single model achieves 30.1 on the evaluation set and the best fusion model 32.5 previously best attempt by 0.1

READ FULL TEXT
research
06/17/2019

Evaluation of post-processing algorithms for polyphonic sound event detection

Sound event detection (SED) aims at identifying audio events (audio tagg...
research
06/27/2023

Post-Processing Independent Evaluation of Sound Event Detection Systems

Due to the high variation in the application requirements of sound event...
research
08/19/2022

Improving Post-Processing of Audio Event Detectors Using Reinforcement Learning

We apply post-processing to the class probability distribution outputs o...
research
01/19/2021

Towards duration robust weakly supervised sound event detection

Sound event detection (SED) is the task of tagging the absence or presen...
research
09/11/2019

Guided Learning Convolution System for DCASE 2019 Task 4

In this paper, we describe in detail the system we submitted to DCASE201...
research
10/18/2019

A Framework for the Robust Evaluation of Sound Event Detection

This work defines a new framework for performance evaluation of polyphon...
research
01/23/2023

Optimising complexity of CNN models for resource constrained devices: QRS detection case study

Traditional DL models are complex and resource hungry and thus, care nee...

Please sign up or login with your details

Forgot password? Click here to reset