Weakly Labeled Sound Event Detection Using Tri-training and Adversarial Learning

10/14/2019
by   Hyoungwoo Park, et al.
0

This paper considers a semi-supervised learning framework for weakly labeled polyphonic sound event detection problems for the DCASE 2019 challenge's task4 by combining both the tri-training and adversarial learning. The goal of the task4 is to detect onsets and offsets of multiple sound events in a single audio clip. The entire dataset consists of the synthetic data with a strong label (sound event labels with boundaries) and real data with weakly labeled (sound event labels) and unlabeled dataset. Given this dataset, we apply the tri-training where two different classifiers are used to obtain pseudo labels on the weakly labeled and unlabeled dataset, and the final classifier is trained using the strongly labeled dataset and weakly/unlabeled dataset with pseudo labels. Also, we apply the adversarial learning to reduce the domain gap between the real and synthetic dataset. We evaluated our learning framework using the validation set of the task4 dataset, and in the experiments, our learning framework shows a considerable performance improvement over the baseline model.

READ FULL TEXT
research
11/01/2018

Weakly supervised CRNN system for sound event detection with large-scale unlabeled in-domain data

Sound event detection (SED) is typically posed as a supervised learning ...
research
07/27/2018

Large-Scale Weakly Labeled Semi-Supervised Sound Event Detection in Domestic Environments

This paper presents DCASE 2018 task 4. The task evaluates systems for th...
research
05/27/2021

Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures

Sound event detection is an important facet of audio tagging that aims t...
research
11/02/2020

Learning generic feature representation with synthetic data for weakly-supervised sound event detection by inter-frame distance loss

Due to the limitation of strong-labeled sound event detection data set, ...
research
05/23/2020

Power Pooling Operators and Confidence Learning for Semi-Supervised Sound Event Detection

In recent years, the involvement of synthetic strongly labeled data,weak...

Please sign up or login with your details

Forgot password? Click here to reset