Disentangled Feature for Weakly Supervised Multi-class Sound Event Detection

05/24/2019
by   Liwei Lin, et al.
0

We propose a disentangled feature for weakly supervised multiclass sound event detection (SED), which helps ameliorate the performance and the training efficiency of class-wise attention based detection system by the introduction of more class-wise prior information as well as the network redundancy weight reduction. In this paper, we approach SED as a multiple instance learning (MIL) problem and utilize a neural network framework with class-wise attention pooling (cATP) module to solve it. Aiming at making finer detection even if there is only a small number of clips with less co-occurrence of the categories available in the training set, we optimize the high-level feature space of cATP-MIL by disentangling it based on class-wise identifiable information in the training set and obtain multiple different subspaces. Experiments show that our approach achieves competitive performance on Task4 of the DCASE2018 challenge.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2019

Specialized Decision Surface and Disentangled Feature for Weakly-Supervised Polyphonic Sound Event Detection

Sound event detection (SED) is to recognize the presence of sound events...
research
03/28/2019

Hierarchical Pooling Structure for Weakly Labeled Sound Event Detection

Sound event detection with weakly labeled data is considered as a proble...
research
07/21/2020

Guided multi-branch learning systems for DCASE 2020 Task 4

In this paper, we describe in detail our systems for DCASE 2020 Task 4. ...
research
03/10/2023

Improving Weakly Supervised Sound Event Detection with Causal Intervention

Existing weakly supervised sound event detection (WSSED) work has not ex...
research
09/11/2019

Guided Learning Convolution System for DCASE 2019 Task 4

In this paper, we describe in detail the system we submitted to DCASE201...
research
08/07/2020

A Joint Framework for Audio Tagging and Weakly Supervised Acoustic Event Detection Using DenseNet with Global Average Pooling

This paper proposes a network architecture mainly designed for audio tag...
research
04/06/2022

A Weakly Supervised Propagation Model for Rumor Verification and Stance Detection with Multiple Instance Learning

The diffusion of rumors on microblogs generally follows a propagation tr...

Please sign up or login with your details

Forgot password? Click here to reset