Semi-supervised Sound Event Detection with Local and Global Consistency Regularization

09/15/2023
by   Yiming Li, et al.
0

Learning meaningful frame-wise features on a partially labeled dataset is crucial to semi-supervised sound event detection. Prior works either maintain consistency on frame-level predictions or seek feature-level similarity among neighboring frames, which cannot exploit the potential of unlabeled data. In this work, we design a Local and Global Consistency (LGC) regularization scheme to enhance the model on both label- and feature-level. The audio CutMix is introduced to change the contextual information of clips. Then, the local consistency is adopted to encourage the model to leverage local features for frame-level predictions, and the global consistency is applied to force features to align with global prototypes through a specially designed contrastive loss. Experiments on the DESED dataset indicate the superiority of LGC, surpassing its respective competitors largely with the same settings as the baseline system. Besides, combining LGC with existing methods can obtain further improvements. The code will be released soon.

READ FULL TEXT
research
01/30/2021

Semi-supervised Sound Event Detection using Random Augmentation and Consistency Regularization

Sound event detection is a core module for acoustic environmental analys...
research
03/06/2019

Semi-Supervised Few-Shot Learning with Local and Global Consistency

Learning from a few examples is a key characteristic of human intelligen...
research
10/18/2022

A Hybrid System of Sound Event Detection Transformer and Frame-wise Model for DCASE 2022 Task 4

In this paper, we describe in detail our system for DCASE 2022 Task4. Th...
research
04/22/2021

Framing Unpacked: A Semi-Supervised Interpretable Multi-View Model of Media Frames

Understanding how news media frame political issues is important due to ...
research
07/17/2019

HODGEPODGE: Sound event detection based on ensemble of semi-supervised learning methods

In this paper, we present a method called HODGEPODGE[1] for large-scale ...
research
10/21/2021

RCT: Random Consistency Training for Semi-supervised Sound Event Detection

Sound event detection (SED), as a core module of acoustic environmental ...
research
03/04/2021

Semi-supervised Left Atrium Segmentation with Mutual Consistency Training

Semi-supervised learning has attracted great attention in the field of m...

Please sign up or login with your details

Forgot password? Click here to reset