Audio-Based Music Classification with DenseNet And Data Augmentation

06/15/2019
by   Wenhao Bian, et al.
0

In recent years, deep learning technique has received intense attention owing to its great success in image recognition. A tendency of adaption of deep learning in various information processing fields has formed, including music information retrieval (MIR). In this paper, we conduct a comprehensive study on music audio classification with improved convolutional neural networks (CNNs). To the best of our knowledge, this the first work to apply Densely Connected Convolutional Networks (DenseNet) to music audio tagging, which has been demonstrated to perform better than Residual neural network (ResNet). Additionally, two specific data augmentation approaches of time overlapping and pitch shifting have been proposed to address the deficiency of labelled data in the MIR. Moreover, an ensemble learning of stacking is employed based on SVM. We believe that the proposed combination of strong representation of DenseNet and data augmentation can be adapted to other audio processing tasks.

READ FULL TEXT
research
12/11/2019

Audiogmenter: a MATLAB Toolbox for Audio Data Augmentation

Audio data augmentation is a key step in training deep neural networks f...
research
10/23/2019

Graph Representation learning for Audio Music genre Classification

Music genre is arguably one of the most important and discriminative inf...
research
01/15/2020

Deep Learning for MIR Tutorial

Deep Learning has become state of the art in visual computing and contin...
research
10/11/2017

Audio Concept Classification with Hierarchical Deep Neural Networks

Audio-based multimedia retrieval tasks may identify semantic information...
research
08/23/2021

Learning Sparse Analytic Filters for Piano Transcription

In recent years, filterbank learning has become an increasingly popular ...
research
06/19/2017

Kapre: On-GPU Audio Preprocessing Layers for a Quick Implementation of Deep Neural Network Models with Keras

We introduce Kapre, Keras layers for audio and music signal preprocessin...

Please sign up or login with your details

Forgot password? Click here to reset