Automatic Classification of Music Genre using Masked Conditional Neural Networks

01/16/2018
by   Fady Medhat, et al.
0

Neural network based architectures used for sound recognition are usually adapted from other application domains such as image recognition, which may not harness the time-frequency representation of a signal. The ConditionaL Neural Networks (CLNN) and its extension the Masked ConditionaL Neural Networks (MCLNN) are designed for multidimensional temporal signal recognition. The CLNN is trained over a window of frames to preserve the inter-frame relation, and the MCLNN enforces a systematic sparseness over the network's links that mimics a filterbank-like behavior. The masking operation induces the network to learn in frequency bands, which decreases the network susceptibility to frequency-shifts in time-frequency representations. Additionally, the mask allows an exploration of a range of feature combinations concurrently analogous to the manual handcrafting of the optimum collection of features for a recognition task. MCLNN have achieved competitive performance on the Ballroom music dataset compared to several hand-crafted attempts and outperformed models based on state-of-the-art Convolutional Neural Networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2018

Environmental Sound Recognition using Masked Conditional Neural Networks

Neural network based architectures used for sound recognition are usuall...
research
02/18/2018

Music Genre Classification using Masked Conditional Neural Networks

The ConditionaL Neural Networks (CLNN) and the Masked ConditionaL Neural...
research
05/25/2018

Masked Conditional Neural Networks for Environmental Sound Classification

The ConditionaL Neural Network (CLNN) exploits the nature of the tempora...
research
02/15/2018

Masked Conditional Neural Networks for Automatic Sound Events Recognition

Deep neural network architectures designed for application domains other...
research
03/06/2018

Masked Conditional Neural Networks for Audio Classification

We present the ConditionaL Neural Network (CLNN) and the Masked Conditio...
research
06/20/2019

Adversarial Learning for Improved Onsets and Frames Music Transcription

Automatic music transcription is considered to be one of the hardest pro...
research
07/25/2023

Fitting Auditory Filterbanks with Multiresolution Neural Networks

Waveform-based deep learning faces a dilemma between nonparametric and p...

Please sign up or login with your details

Forgot password? Click here to reset