A Deep Bag-of-Features Model for Music Auto-Tagging

08/20/2015
by   Juhan Nam, et al.
0

Feature learning and deep learning have drawn great attention in recent years as a way of transforming input data into more effective representations using learning algorithms. Such interest has grown in the area of music information retrieval (MIR) as well, particularly in music audio classification tasks such as auto-tagging. In this paper, we present a two-stage learning model to effectively predict multiple labels from music audio. The first stage learns to project local spectral patterns of an audio track onto a high-dimensional sparse space in an unsupervised manner and summarizes the audio track as a bag-of-features. The second stage successively performs the unsupervised learning on the bag-of-features in a layer-by-layer manner to initialize a deep neural network and finally fine-tunes it with the tag labels. Through the experiment, we rigorously examine training choices and tuning parameters, and show that the model achieves high performance on Magnatagatune, a popularly used dataset in music auto-tagging.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2017

Representation Learning of Music Using Artist Labels

Recently, feature representation by learning algorithms has drawn great ...
research
01/30/2021

Melon Playlist Dataset: a public dataset for audio-based playlist generation and music tagging

One of the main limitations in the field of audio signal processing is t...
research
10/17/2021

Deep Learning Based EDM Subgenre Classification using Mel-Spectrogram and Tempogram Features

Along with the evolution of music technology, a large number of styles, ...
research
03/06/2017

Multi-Level and Multi-Scale Feature Aggregation Using Pre-trained Convolutional Neural Networks for Music Auto-tagging

Music auto-tagging is often handled in a similar manner to image classif...
research
04/05/2017

Revisiting the problem of audio-based hit song prediction using convolutional neural networks

Being able to predict whether a song can be a hit has impor- tant applic...
research
08/31/2020

Detecting Generic Music Features with Single Layer Feedforward Network using Unsupervised Hebbian Computation

With the ever-increasing number of digital music and vast music track fe...
research
07/23/2018

Auto-adaptive Resonance Equalization using Dilated Residual Networks

In music and audio production, attenuation of spectral resonances is an ...

Please sign up or login with your details

Forgot password? Click here to reset