Learnable Front Ends Based on Temporal Modulation for Music Tagging

11/28/2022
by   Yinghao Ma, et al.
0

While end-to-end systems are becoming popular in auditory signal processing including automatic music tagging, models using raw audio as input needs a large amount of data and computational resources without domain knowledge. Inspired by the fact that temporal modulation is regarded as an essential component in auditory perception, we introduce the Temporal Modulation Neural Network (TMNN) that combines Mel-like data-driven front ends and temporal modulation filters with a simple ResNet back end. The structure includes a set of temporal modulation filters to capture long-term patterns in all frequency channels. Experimental results show that the proposed front ends surpass state-of-the-art (SOTA) methods on the MagnaTagATune dataset in automatic music tagging, and they are also helpful for keyword spotting on speech commands. Moreover, the model performance for each tag suggests that genre or instrument tags with complex rhythm and mood tags can especially be improved with temporal modulation.

READ FULL TEXT
research
05/25/2021

A Modulation Front-End for Music Audio Tagging

Convolutional Neural Networks have been extensively explored in the task...
research
11/07/2017

End-to-end learning for music audio tagging at scale

The lack of data tends to limit the outcomes of deep learning research -...
research
06/23/2023

Modulation Graphs in Popular Music

In this paper, graph theory is used to explore the musical notion of ton...
research
05/30/2018

Progressive Evaluation of Queries over Untagged Data

Modern information systems often collect raw data in the form of text, i...
research
05/30/2018

Progressive Evaluation of Queries over Tagged Data

Modern information systems often collect raw data in the form of text, i...
research
06/07/2017

The Effects of Noisy Labels on Deep Convolutional Neural Networks for Music Tagging

Deep neural networks (DNN) have been successfully applied to music class...
research
03/24/2022

Data-Driven Visual Reflection on Music Instrument Practice

We propose a data-driven approach to music instrument practice that allo...

Please sign up or login with your details

Forgot password? Click here to reset