Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection

03/04/2022
by   Yunhao Liang, et al.
0

In recent years, exploring effective sound separation (SSep) techniques to improve overlapping sound event detection (SED) attracts more and more attention. Creating accurate separation signals to avoid the catastrophic error accumulation during SED model training is very important and challenging. In this study, we first propose a novel selective pseudo-labeling approach, termed SPL, to produce high confidence separated target events from blind sound separation outputs. These target events are then used to fine-tune the original SED model that pre-trained on the sound mixtures in a multi-objective learning style. Then, to further leverage the SSep outputs, a class-wise discriminative fusion is proposed to improve the final SED performances, by combining multiple frame-level event predictions of both sound mixtures and their separated signals. All experiments are performed on the public DCASE 2021 Task 4 dataset, and results show that our approaches significantly outperforms the official baseline, the collar-based F 1, PSDS1 and PSDS2 performances are improved from 44.3

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2020

Improving Sound Event Detection In Domestic Environments Using Sound Separation

Performing sound event detection on real-world recordings often implies ...
research
07/11/2019

Polyphonic Sound Event and Sound Activity Detection: A Multi-task approach

Polyphonic Sound Event Detection (SED) in real-world recordings is a cha...
research
04/13/2022

Sound Event Triage: Detecting Sound Events Considering Priority of Classes

We propose a new task for sound event detection (SED): sound event triag...
research
11/18/2022

Self-Remixing: Unsupervised Speech Separation via Separation and Remixing

We present Self-Remixing, a novel self-supervised speech separation meth...
research
04/23/2022

Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation

Recently, supervised speech separation has made great progress. However,...
research
03/23/2021

GISE-51: A scalable isolated sound events dataset

Most of the existing isolated sound event datasets comprise a small numb...
research
08/30/2019

Recursive Visual Sound Separation Using Minus-Plus Net

Sounds provide rich semantics, complementary to visual data, for many ta...

Please sign up or login with your details

Forgot password? Click here to reset