Hodge and Podge: Hybrid Supervised Sound Event Detection with Multi-Hot MixMatch and Composition Consistence Training

02/13/2020
by   Ziqiang Shi, et al.
0

In this paper, we propose a method called Hodge and Podge for sound event detection. We demonstrate Hodge and Podge on the dataset of Detection and Classification of Acoustic Scenes and Events (DCASE) 2019 Challenge Task 4. This task aims to predict the presence or absence and the onset and offset times of sound events in home environments. Sound event detection is challenging due to the lack of large scale real strongly labeled data. Recently deep semi-supervised learning (SSL) has proven to be effective in modeling with weakly labeled and unlabeled data. This work explores how to extend deep SSL to result in a new, state-of-the-art sound event detection method called Hodge and Podge. With convolutional recurrent neural networks (CRNN) as the backbone network, first, a multi-scale squeeze-excitation mechanism is introduced and added to generate a pyramid squeeze-excitation CRNN. The pyramid squeeze-excitation layer can pay attention to the issue that different sound events have different durations, and to adaptively recalibrate channel-wise spectrogram responses. Further, in order to remedy the lack of real strongly labeled data problem, we propose multi-hot MixMatch and composition consistency training with temporal-frequency augmentation. Our experiments with the public DCASE2019 challenge task 4 validation data resulted in an event-based F-score of 43.4%, and is about absolutely 1.6% better than state-of-the-art methods in the challenge. While the F-score of the official baseline is 25.8%.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2019

HODGEPODGE: Sound event detection based on ensemble of semi-supervised learning methods

In this paper, we present a method called HODGEPODGE[1] for large-scale ...
research
10/07/2021

Peer Collaborative Learning for Polyphonic Sound Event Detection

This paper describes that semi-supervised learning called peer collabora...
research
06/21/2022

A Multi-grained based Attention Network for Semi-supervised Sound Event Detection

Sound event detection (SED) is an interesting but challenging task due t...
research
11/01/2018

Weakly supervised CRNN system for sound event detection with large-scale unlabeled in-domain data

Sound event detection (SED) is typically posed as a supervised learning ...
research
07/08/2021

Heavily Augmented Sound Event Detection utilizing Weak Predictions

The performances of Sound Event Detection (SED) systems are greatly limi...
research
10/25/2019

Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection

Event detection (ED), a sub-task of event extraction, involves identifyi...
research
09/21/2020

Detecting Acoustic Events Using Convolutional Macaron Net

In this paper, we propose to address the issue of the lack of strongly l...

Please sign up or login with your details

Forgot password? Click here to reset