Multi-task Learning with Metadata for Music Mood Classification

10/10/2021
by   Rajnish Kumar, et al.
0

Mood recognition is an important problem in music informatics and has key applications in music discovery and recommendation. These applications have become even more relevant with the rise of music streaming. Our work investigates the research question of whether we can leverage audio metadata such as artist and year, which is readily available, to improve the performance of mood classification models. To this end, we propose a multi-task learning approach in which a shared model is simultaneously trained for mood and metadata prediction tasks with the goal to learn richer representations. Experimentally, we demonstrate that applying our technique on the existing state-of-the-art convolutional neural networks for mood classification improves their performances consistently. We conduct experiments on multiple datasets and report that our approach can lead to improvements in the average precision metric by up to 8.7 points.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2021

Multi-Task Reinforcement Learning with Context-based Representations

The benefit of multi-task learning over single-task learning relies on t...
research
09/18/2018

Multi-Task Learning for Machine Reading Comprehension

We propose a multi-task learning framework to jointly train a Machine Re...
research
08/30/2018

Large-Scale Cover Song Detection in Digital Music Libraries Using Metadata, Lyrics and Audio Features

Cover song detection is a very relevant task in Music Information Retrie...
research
12/03/2018

Multi-task Learning of Hierarchical Vision-Language Representation

It is still challenging to build an AI system that can perform tasks tha...
research
01/06/2021

Investigating the efficacy of music version retrieval systems for setlist identification

The setlist identification (SLI) task addresses a music recognition use ...
research
07/16/2021

DoReMi: First glance at a universal OMR dataset

The main challenges of Optical Music Recognition (OMR) come from the nat...
research
04/06/2020

A Generalized Multi-Task Learning Approach to Stereo DSM Filtering in Urban Areas

City models and height maps of urban areas serve as a valuable data sour...

Please sign up or login with your details

Forgot password? Click here to reset