An Attention-based Approach to Hierarchical Multi-label Music Instrument Classification

02/16/2023
by   Zhi Zhong, et al.
0

Although music is typically multi-label, many works have studied hierarchical music tagging with simplified settings such as single-label data. Moreover, there lacks a framework to describe various joint training methods under the multi-label setting. In order to discuss the above topics, we introduce hierarchical multi-label music instrument classification task. The task provides a realistic setting where multi-instrument real music data is assumed. Various hierarchical methods that jointly train a DNN are summarized and explored in the context of the fusion of deep learning and conventional techniques. For the effective joint training in the multi-label setting, we propose two methods to model the connection between fine- and coarse-level tags, where one uses rule-based grouped max-pooling, the other one uses the attention mechanism obtained in a data-driven manner. Our evaluation reveals that the proposed methods have advantages over the method without joint training. In addition, the decision procedure within the proposed methods can be interpreted by visualizing attention maps or referring to fixed rules.

READ FULL TEXT
research
03/14/2023

Improving Music Genre Classification from multi-modal properties of music and genre correlations Perspective

Music genre classification has been widely studied in past few years for...
research
08/23/2018

Review-Driven Multi-Label Music Style Classification by Exploiting Style Correlations

This paper explores a new natural language processing task, review-drive...
research
06/17/2020

Visual Attention for Musical Instrument Recognition

In the field of music information retrieval, the task of simultaneously ...
research
12/26/2020

Coarse to Fine: Multi-label Image Classification with Global/Local Attention

In our daily life, the scenes around us are always with multiple labels ...
research
12/19/2019

A multi-label classification method using a hierarchical and transparent representation for paper-reviewer recommendation

Paper-reviewer recommendation task is of significant academic importance...
research
03/23/2023

Frame-Level Multi-Label Playing Technique Detection Using Multi-Scale Network and Self-Attention Mechanism

Instrument playing technique (IPT) is a key element of musical presentat...
research
03/15/2011

Autotagging music with conditional restricted Boltzmann machines

This paper describes two applications of conditional restricted Boltzman...

Please sign up or login with your details

Forgot password? Click here to reset