Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures

05/27/2021
by   Sangwook Park, et al.
0

Sound event detection is an important facet of audio tagging that aims to identify sounds of interest and define both the sound category and time boundaries for each sound event in a continuous recording. With advances in deep neural networks, there has been tremendous improvement in the performance of sound event detection systems, although at the expense of costly data collection and labeling efforts. In fact, current state-of-the-art methods employ supervised training methods that leverage large amounts of data samples and corresponding labels in order to facilitate identification of sound category and time stamps of events. As an alternative, the current study proposes a semi-supervised method for generating pseudo-labels from unsupervised data using a student-teacher scheme that balances self-training and cross-training. Additionally, this paper explores post-processing which extracts sound intervals from network prediction, for further improvement in sound event detection performance. The proposed approach is evaluated on sound event detection task for the DCASE2020 challenge. The results of these methods on both "validation" and "public evaluation" sets of DESED database show significant improvement compared to the state-of-the art systems in semi-supervised learning.

READ FULL TEXT

page 1

page 13

research
01/30/2021

Semi-supervised Sound Event Detection using Random Augmentation and Consistency Regularization

Sound event detection is a core module for acoustic environmental analys...
research
11/09/2018

Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection

Sound event detection is a challenging task, especially for scenes with ...
research
10/14/2019

Weakly Labeled Sound Event Detection Using Tri-training and Adversarial Learning

This paper considers a semi-supervised learning framework for weakly lab...
research
07/17/2019

HODGEPODGE: Sound event detection based on ensemble of semi-supervised learning methods

In this paper, we present a method called HODGEPODGE[1] for large-scale ...
research
05/02/2020

Addressing Missing Labels in Large-scale Sound Event Recognition using a Teacher-student Framework with Loss Masking

The study of label noise in sound event recognition has recently gained ...
research
10/21/2021

RCT: Random Consistency Training for Semi-supervised Sound Event Detection

Sound event detection (SED), as a core module of acoustic environmental ...
research
06/21/2022

A Multi-grained based Attention Network for Semi-supervised Sound Event Detection

Sound event detection (SED) is an interesting but challenging task due t...

Please sign up or login with your details

Forgot password? Click here to reset