End-to-end learning for music audio tagging at scale

11/07/2017
by   Jordi Pons, et al.
0

The lack of data tends to limit the outcomes of deep learning research - specially, when dealing with end-to-end learning stacks processing raw data such as waveforms. In this study we make use of musical labels annotated for 1.2 million tracks. This large amount of data allows us to unrestrictedly explore different front-end paradigms: from assumption-free models - using waveforms as input with very small convolutional filters; to models that rely on domain knowledge - log-mel spectrograms with a convolutional neural network designed to learn temporal and timbral features. Results suggest that while spectrogram-based models surpass their waveform-based counterparts, the difference in performance shrinks as more data are employed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2022

Learnable Front Ends Based on Temporal Modulation for Music Tagging

While end-to-end systems are becoming popular in auditory signal process...
research
05/25/2021

A Modulation Front-End for Music Audio Tagging

Convolutional Neural Networks have been extensively explored in the task...
research
03/06/2017

Sample-level Deep Convolutional Neural Networks for Music Auto-tagging Using Raw Waveforms

Recently, the end-to-end approach that learns hierarchical representatio...
research
06/27/2018

Deep Steganalysis: End-to-End Learning with Supervisory Information beyond Class Labels

Recently, deep learning has shown its power in steganalysis. However, th...
research
11/03/2017

Learning Filterbanks from Raw Speech for Phone Recognition

We train a bank of complex filters that operates on the raw waveform and...
research
04/06/2021

MuSLCAT: Multi-Scale Multi-Level Convolutional Attention Transformer for Discriminative Music Modeling on Raw Waveforms

In this work, we aim to improve the expressive capacity of waveform-base...
research
04/05/2017

Revisiting the problem of audio-based hit song prediction using convolutional neural networks

Being able to predict whether a song can be a hit has impor- tant applic...

Please sign up or login with your details

Forgot password? Click here to reset