MATT: A Multiple-instance Attention Mechanism for Long-tail Music Genre Classification

09/09/2022
by   Xiaokai Liu, et al.
0

Imbalanced music genre classification is a crucial task in the Music Information Retrieval (MIR) field for identifying the long-tail, data-poor genre based on the related music audio segments, which is very prevalent in real-world scenarios. Most of the existing models are designed for class-balanced music datasets, resulting in poor performance in accuracy and generalization when identifying the music genres at the tail of the distribution. Inspired by the success of introducing Multi-instance Learning (MIL) in various classification tasks, we propose a novel mechanism named Multi-instance Attention (MATT) to boost the performance for identifying tail classes. Specifically, we first construct the bag-level datasets by generating the album-artist pair bags. Second, we leverage neural networks to encode the music audio segments. Finally, under the guidance of a multi-instance attention mechanism, the neural network-based models could select the most informative genre to match the given music segment. Comprehensive experimental results on a large-scale music genre benchmark dataset with long-tail distribution demonstrate MATT significantly outperforms other state-of-the-art baselines.

READ FULL TEXT

page 1

page 3

research
03/04/2019

Long-tail Relation Extraction via Knowledge Graph Embeddings and Graph Convolution Networks

We propose a distance supervised relation extraction approach for long-t...
research
09/08/2023

A Long-Tail Friendly Representation Framework for Artist and Music Similarity

The investigation of the similarity between artists and music is crucial...
research
03/24/2022

Score difficulty analysis for piano performance education based on fingering

In this paper, we introduce score difficulty classification as a sub-tas...
research
06/26/2019

Learning Soft-Attention Models for Tempo-invariant Audio-Sheet Music Retrieval

Connecting large libraries of digitized audio recordings to their corres...
research
08/25/2022

A Study on Broadcast Networks for Music Genre Classification

Due to the increased demand for music streaming/recommender services and...
research
05/17/2023

Characterizing Long-Tail Categories on Graphs

Long-tail data distributions are prevalent in many real-world networks, ...
research
09/15/2018

Attention as a Perspective for Learning Tempo-invariant Audio Queries

Current models for audio--sheet music retrieval via multimodal embedding...

Please sign up or login with your details

Forgot password? Click here to reset