Masked Conditional Neural Networks for Automatic Sound Events Recognition

02/15/2018
by   Fady Medhat, et al.
0

Deep neural network architectures designed for application domains other than sound, especially image recognition, may not optimally harness the time-frequency representation when adapted to the sound recognition problem. In this work, we explore the ConditionaL Neural Network (CLNN) and the Masked ConditionaL Neural Network (MCLNN) for multi-dimensional temporal signal recognition. The CLNN considers the inter-frame relationship, and the MCLNN enforces a systematic sparseness over the network's links to enable learning in frequency bands rather than bins allowing the network to be frequency shift invariant mimicking a filterbank. The mask also allows considering several combinations of features concurrently, which is usually handcrafted through exhaustive manual search. We applied the MCLNN to the environmental sound recognition problem using the ESC-10 and ESC-50 datasets. MCLNN achieved competitive performance, using 12 compared to state-of-the-art Convolutional Neural Networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2018

Recognition of Acoustic Events Using Masked Conditional Neural Networks

Automatic feature extraction using neural networks has accomplished rema...
research
04/09/2020

Fast frequency discrimination and phoneme recognition using a biomimetic membrane coupled to a neural network

In the human ear, the basilar membrane plays a central role in sound rec...
research
01/16/2018

Automatic Classification of Music Genre using Masked Conditional Neural Networks

Neural network based architectures used for sound recognition are usuall...
research
02/18/2018

Music Genre Classification using Masked Conditional Neural Networks

The ConditionaL Neural Networks (CLNN) and the Masked ConditionaL Neural...
research
12/15/2018

Deep Synthesizer Parameter Estimation

Sound synthesis is a complex field that requires domain expertise. Manua...
research
02/18/2023

Multi-dimensional frequency dynamic convolution with confident mean teacher for sound event detection

Recently, convolutional neural networks (CNNs) have been widely used in ...
research
10/04/2018

Deep Learning Approaches for Understanding Simple Speech Commands

Automatic classification of sound commands is becoming increasingly impo...

Please sign up or login with your details

Forgot password? Click here to reset