Sound event detection using weakly-labeled semi-supervised data with GCRNNS, VAT and Self-Adaptive Label Refinement

10/16/2018
by   Robert Harb, et al.
0

In this paper, we present a gated convolutional recurrent neural network based approach to solve task 4, large-scale weakly labelled semi-supervised sound event detection in domestic environments, of the DCASE 2018 challenge. Gated linear units and a temporal attention layer are used to predict the onset and offset of sound events in 10s long audio clips. Whereby for training only weakly-labelled data is used. Virtual adversarial training is used for regularization, utilizing both labelled and unlabeled data. Furthermore, we introduce self-adaptive label refinement, a method which allows unsupervised adaption of our trained system to refine the accuracy of frame-level class predictions. The proposed system reaches an overall macro averaged event-based F-score of 34.6 baseline system.

READ FULL TEXT

page 1

page 2

page 4

research
10/01/2017

Large-scale weakly supervised audio classification using gated convolutional neural network

In this paper, we present a gated convolutional neural network and a tem...
research
11/01/2018

Weakly supervised CRNN system for sound event detection with large-scale unlabeled in-domain data

Sound event detection (SED) is typically posed as a supervised learning ...
research
04/27/2019

Sound Event Detection with Sequentially Labelled Data Based on Connectionist Temporal Classification and Unsupervised Clustering

Sound event detection (SED) methods typically rely on either strongly la...
research
07/02/2020

Semi-Supervised NMF-CNN For Sound Event Detection

For the DCASE 2020 Challenge Task 4, this paper pro-posed a combinative ...
research
07/27/2018

Large-Scale Weakly Labeled Semi-Supervised Sound Event Detection in Domestic Environments

This paper presents DCASE 2018 task 4. The task evaluates systems for th...

Please sign up or login with your details

Forgot password? Click here to reset