Receptive-Field Regularized CNNs for Music Classification and Tagging

07/27/2020
by   Khaled Koutini, et al.
0

Convolutional Neural Networks (CNNs) have been successfully used in various Music Information Retrieval (MIR) tasks, both as end-to-end models and as feature extractors for more complex systems. However, the MIR field is still dominated by the classical VGG-based CNN architecture variants, often in combination with more complex modules such as attention, and/or techniques such as pre-training on large datasets. Deeper models such as ResNet – which surpassed VGG by a large margin in other domains – are rarely used in MIR. One of the main reasons for this, as we will show, is the lack of generalization of deeper CNNs in the music domain. In this paper, we present a principled way to make deep architectures like ResNet competitive for music-related tasks, based on well-designed regularization strategies. In particular, we analyze the recently introduced Receptive-Field Regularization and Shake-Shake, and show that they significantly improve the generalization of deep CNNs on music-related tasks, and that the resulting deep CNNs can outperform current more complex models such as CNNs augmented with pre-training and attention. We demonstrate this on two different MIR tasks and two corresponding datasets, thus offering our deep regularized CNNs as a new baseline for these datasets, which can also be used as a feature-extracting module in future, more complex approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2021

Receptive Field Regularization Techniques for Audio Classification and Tagging with Deep Convolutional Neural Networks

In this paper, we study the performance of variants of well-known Convol...
research
10/28/2019

Emotion and Theme Recognition in Music with Frequency-Aware RF-Regularized CNNs

We present CP-JKU submission to MediaEval 2019; a Receptive Field-(RF)-r...
research
09/05/2019

Receptive-field-regularized CNN variants for acoustic scene classification

Acoustic scene classification and related tasks have been dominated by C...
research
07/03/2019

The Receptive Field as a Regularizer in Deep Convolutional Neural Networks for Acoustic Scene Classification

Convolutional Neural Networks (CNNs) have had great success in many mach...
research
12/19/2014

Generative Modeling of Convolutional Neural Networks

The convolutional neural networks (CNNs) have proven to be a powerful to...
research
06/01/2020

Evaluation of CNN-based Automatic Music Tagging Models

Recent advances in deep learning accelerated the development of content-...
research
03/10/2020

Channel Attention with Embedding Gaussian Process: A Probabilistic Methodology

Channel attention mechanisms, as the key components of some modern convo...

Please sign up or login with your details

Forgot password? Click here to reset