A Capsule based Approach for Polyphonic Sound Event Detection

07/19/2018
by   Yaming Liu, et al.
0

Polyphonic sound event detection (polyphonic SED) is an interesting but challenging task due to the concurrence of multiple sound events. Recently, SED methods based on convolutional neural networks (CNN) and recurrent neural networks (RNN) have shown promising performance. Generally, CNN are designed for local feature extraction while RNN are used to model the temporal dependency among these local features. Despite their success, it is still insufficient for existing deep learning techniques to separate individual sound event from their mixture, largely due to the overlapping characteristic of features. Motivated by the success of Capsule Networks (CapsNet), we propose a more suitable capsule based approach for polyphonic SED. Specifically, several capsule layers are designed to effectively select representative frequency bands for each individual sound event. The temporal dependency of capsule's outputs is then modeled by a RNN. And a dynamic threshold method is proposed for making the final decision based on RNN outputs. Experiments on the TUT-SED Synthetic 2016 dataset show that the proposed approach obtains an F1-score of 68.8 method of 66.4

READ FULL TEXT
research
10/15/2018

Polyphonic Sound Event Detection by using Capsule Neural Networks

Artificial sound event detection (SED) has the aim to mimic the human ab...
research
10/15/2018

Polyphonic Sound Event Detection by using Capsule Neural Network

Artificial sound event detection (SED) has the aim to mimic the human ab...
research
07/19/2019

Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling

A sound event detection (SED) method typically takes as an input a seque...
research
11/25/2021

Polyphonic Sound Event Detection Using Capsule Neural Network on Multi-Type-Multi-Scale Time-Frequency Representation

The challenges of polyphonic sound event detection (PSED) stem from the ...
research
07/09/2021

Multi-path Convolutional Neural Networks Efficiently Improve Feature Extraction in Continuous Adventitious Lung Sound Detection

We previously established a large lung sound database, HF_Lung_V2 (Lung_...
research
08/13/2020

MIXCAPS: A Capsule Network-based Mixture of Experts for Lung Nodule Malignancy Prediction

Lung diseases including infections such as Pneumonia, Tuberculosis, and ...
research
04/04/2016

Recurrent Neural Networks for Polyphonic Sound Event Detection in Real Life Recordings

In this paper we present an approach to polyphonic sound event detection...

Please sign up or login with your details

Forgot password? Click here to reset