Frame-Level Multi-Label Playing Technique Detection Using Multi-Scale Network and Self-Attention Mechanism

03/23/2023
by   Dichucheng Li, et al.
0

Instrument playing technique (IPT) is a key element of musical presentation. However, most of the existing works for IPT detection only concern monophonic music signals, yet little has been done to detect IPTs in polyphonic instrumental solo pieces with overlapping IPTs or mixed IPTs. In this paper, we formulate it as a frame-level multi-label classification problem and apply it to Guzheng, a Chinese plucked string instrument. We create a new dataset, Guzheng_Tech99, containing Guzheng recordings and onset, offset, pitch, IPT annotations of each note. Because different IPTs vary a lot in their lengths, we propose a new method to solve this problem using multi-scale network and self-attention. The multi-scale network extracts features from different scales, and the self-attention mechanism applied to the feature maps at the coarsest scale further enhances the long-range feature extraction. Our approach outperforms existing works by a large margin, indicating its effectiveness in IPT detection.

READ FULL TEXT

page 1

page 3

research
09/19/2022

Playing Technique Detection by Fusing Note Onset Information in Guzheng Performance

The Guzheng is a kind of traditional Chinese instruments with diverse pl...
research
08/09/2023

PAT: Position-Aware Transformer for Dense Multi-Label Action Detection

We present PAT, a transformer-based network that learns complex temporal...
research
02/13/2022

DEEPCHORUS: A Hybrid Model of Multi-scale Convolution and Self-attention for Chorus Detection

Chorus detection is a challenging problem in musical signal processing a...
research
09/03/2021

Musical Tempo Estimation Using a Multi-scale Network

Recently, some single-step systems without onset detection have shown th...
research
03/03/2022

ViTransPAD: Video Transformer using convolution and self-attention for Face Presentation Attack Detection

Face Presentation Attack Detection (PAD) is an important measure to prev...
research
07/24/2023

MFMAN-YOLO: A Method for Detecting Pole-like Obstacles in Complex Environment

In real-world traffic, there are various uncertainties and complexities ...
research
02/16/2023

An Attention-based Approach to Hierarchical Multi-label Music Instrument Classification

Although music is typically multi-label, many works have studied hierarc...

Please sign up or login with your details

Forgot password? Click here to reset