SEED: Sound Event Early Detection via Evidential Uncertainty

02/05/2022
by   Xujiang Zhao, et al.
9

Sound Event Early Detection (SEED) is an essential task in recognizing the acoustic environments and soundscapes. However, most of the existing methods focus on the offline sound event detection, which suffers from the over-confidence issue of early-stage event detection and usually yield unreliable results. To solve the problem, we propose a novel Polyphonic Evidential Neural Network (PENet) to model the evidential uncertainty of the class probability with Beta distribution. Specifically, we use a Beta distribution to model the distribution of class probabilities, and the evidential uncertainty enriches uncertainty representation with evidence information, which plays a central role in reliable prediction. To further improve the event detection performance, we design the backtrack inference method that utilizes both the forward and backward audio features of an ongoing event. Experiments on the DESED database show that the proposed method can simultaneously improve 13.0% and 3.8% in time delay and detection F1 score compared to the state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2019

A two-step system for sound event localization and detection

Sound event detection and sound event localization requires different fe...
research
12/06/2017

Enabling Early Audio Event Detection with Neural Networks

This paper presents a methodology for early detection of audio events fr...
research
04/25/2020

Sound Event Detection Utilizing Graph Laplacian Regularization with Event Co-occurrence

A limited number of types of sound event occur in an acoustic scene and ...
research
07/24/2018

A Simple Probabilistic Model for Uncertainty Estimation

The article focuses on determining the predictive uncertainty of a model...
research
08/10/2017

DNN and CNN with Weighted and Multi-task Loss Functions for Audio Event Detection

This report presents our audio event detection system submitted for Task...
research
10/20/2020

Power pooling: An adaptive pooling function for weakly labelled sound event detection

Access to large corpora with strongly labelled sound events is expensive...
research
03/06/2019

SNU_IDS at SemEval-2019 Task 3: Addressing Training-Test Class Distribution Mismatch in Conversational Classification

We present several techniques to tackle the mismatch in class distributi...

Please sign up or login with your details

Forgot password? Click here to reset