A Multi-grained based Attention Network for Semi-supervised Sound Event Detection

06/21/2022
by   Ying Hu, et al.
0

Sound event detection (SED) is an interesting but challenging task due to the scarcity of data and diverse sound events in real life. This paper presents a multi-grained based attention network (MGA-Net) for semi-supervised sound event detection. To obtain the feature representations related to sound events, a residual hybrid convolution (RH-Conv) block is designed to boost the vanilla convolution's ability to extract the time-frequency features. Moreover, a multi-grained attention (MGA) module is designed to learn temporal resolution features from coarse-level to fine-level. With the MGA module,the network could capture the characteristics of target events with short- or long-duration, resulting in more accurately determining the onset and offset of sound events. Furthermore, to effectively boost the performance of the Mean Teacher (MT) method, a spatial shift (SS) module as a data perturbation mechanism is introduced to increase the diversity of data. Experimental results show that the MGA-Net outperforms the published state-of-the-art competitors, achieving 53.27 polyphonic sound detection score (PSDS) on the validation and public set respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2019

Multi-Scale Time-Frequency Attention for Rare Sound Event Detection

Attention mechanism has been widely applied to various sound-related tas...
research
02/13/2020

Hodge and Podge: Hybrid Supervised Sound Event Detection with Multi-Hot MixMatch and Composition Consistence Training

In this paper, we propose a method called Hodge and Podge for sound even...
research
02/18/2023

Multi-dimensional frequency dynamic convolution with confident mean teacher for sound event detection

Recently, convolutional neural networks (CNNs) have been widely used in ...
research
02/03/2022

A benchmark of state-of-the-art sound event detection systems evaluated on synthetic soundscapes

This paper proposes a benchmark of submissions to Detection and Classifi...
research
05/27/2021

Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures

Sound event detection is an important facet of audio tagging that aims t...
research
06/13/2021

SoundDet: Polyphonic Sound Event Detection and Localization from Raw Waveform

We present a new framework SoundDet, which is an end-to-end trainable an...

Please sign up or login with your details

Forgot password? Click here to reset