Sample-level Deep Convolutional Neural Networks for Music Auto-tagging Using Raw Waveforms

03/06/2017
by   Jongpil Lee, et al.
0

Recently, the end-to-end approach that learns hierarchical representations from raw data using deep convolutional neural networks has been successfully explored in the image, text and speech domains. This approach was applied to musical signals as well but has been not fully explored yet. To this end, we propose sample-level deep convolutional neural networks which learn representations from very small grains of waveforms (e.g. 2 or 3 samples) beyond typical frame-level input representations. Our experiments show how deep architectures with sample-level filters improve the accuracy in music auto-tagging and they provide results comparable to previous state-of-the-art performances for the Magnatagatune dataset and Million Song Dataset. In addition, we visualize filters learned in a sample-level DCNN in each layer to identify hierarchically learned features and show that they are sensitive to log-scaled frequency along layer, such as mel-frequency spectrogram that is widely used in music classification systems.

READ FULL TEXT
research
10/28/2017

Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms

Recent work has shown that the end-to-end approach using convolutional n...
research
11/12/2019

Music Auto-tagging Using CNNs and Mel-spectrograms With Reduced Frequency and Time Resolution

Automatic tagging of music is an important research topic in Music Infor...
research
05/25/2021

A Modulation Front-End for Music Audio Tagging

Convolutional Neural Networks have been extensively explored in the task...
research
04/08/2018

Learning-based Video Motion Magnification

Video motion magnification techniques allow us to see small motions prev...
research
09/07/2017

Basic Filters for Convolutional Neural Networks Applied to Music: Training or Design?

When convolutional neural networks are used to tackle learning problems ...
research
04/22/2018

Tempo-Invariant Processing of Rhythm with Convolutional Neural Networks

Rhythm patterns can be performed with a wide variation of tempi. This pr...
research
11/07/2017

End-to-end learning for music audio tagging at scale

The lack of data tends to limit the outcomes of deep learning research -...

Please sign up or login with your details

Forgot password? Click here to reset