An Attention Mechanism for Musical Instrument Recognition

07/09/2019
by   Siddharth Gururani, et al.
0

While the automatic recognition of musical instruments has seen significant progress, the task is still considered hard for music featuring multiple instruments as opposed to single instrument recordings. Datasets for polyphonic instrument recognition can be categorized into roughly two categories. Some, such as MedleyDB, have strong per-frame instrument activity annotations but are usually small in size. Other, larger datasets such as OpenMIC only have weak labels, i.e., instrument presence or absence is annotated only for long snippets of a song. We explore an attention mechanism for handling weakly labeled data for multi-label instrument recognition. Attention has been found to perform well for other tasks with weakly labeled data. We compare the proposed attention model to multiple models which include a baseline binary relevance random forest, recurrent neural network, and fully connected neural networks. Our results show that incorporating attention leads to an overall improvement in classification accuracy metrics across all 20 instruments in the OpenMIC dataset. We find that attention enables models to focus on (or `attend to') specific time segments in the audio relevant to each instrument label leading to interpretable results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2020

Visual Attention for Musical Instrument Recognition

In the field of music information retrieval, the task of simultaneously ...
research
01/24/2020

Learning Multi-instrument Classification with Partial Labels

Multi-instrument recognition is the task of predicting the presence or a...
research
06/25/2018

Frame-level Instrument Recognition by Timbre and Pitch

Instrument recognition is a fundamental task in music information retrie...
research
11/03/2018

Multitask learning for frame-level instrument recognition

For many music analysis problems, we need to know the presence of instru...
research
06/18/2018

Towards multi-instrument drum transcription

Automatic drum transcription, a subtask of the more general automatic mu...
research
07/13/2021

Timbre Classification of Musical Instruments with a Deep Learning Multi-Head Attention-Based Model

The aim of this work is to define a model based on deep learning that is...
research
05/05/2018

Weakly-supervised Visual Instrument-playing Action Detection in Videos

Instrument playing is among the most common scenes in music-related vide...

Please sign up or login with your details

Forgot password? Click here to reset