A Phoneme-Informed Neural Network Model for Note-Level Singing Transcription

04/12/2023
by   Sangeon Yong, et al.
0

Note-level automatic music transcription is one of the most representative music information retrieval (MIR) tasks and has been studied for various instruments to understand music. However, due to the lack of high-quality labeled data, transcription of many instruments is still a challenging task. In particular, in the case of singing, it is difficult to find accurate notes due to its expressiveness in pitch, timbre, and dynamics. In this paper, we propose a method of finding note onsets of singing voice more accurately by leveraging the linguistic characteristics of singing, which are not seen in other instruments. The proposed model uses mel-scaled spectrogram and phonetic posteriorgram (PPG), a frame-wise likelihood of phoneme, as an input of the onset detection network while PPG is generated by the pre-trained network with singing and speech data. To verify how linguistic features affect onset detection, we compare the evaluation results through the dataset with different languages and divide onset types for detailed analysis. Our approach substantially improves the performance of singing transcription and therefore emphasizes the importance of linguistic features in singing analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/05/2020

From Note-Level to Chord-Level Neural Network Models for Voice Separation in Symbolic Music

Music is often experienced as a progression of concurrent streams of not...
research
07/19/2017

Metrical-accent Aware Vocal Onset Detection in Polyphonic Audio

The goal of this study is the automatic detection of onsets of the singi...
research
06/25/2018

Frame-level Instrument Recognition by Timbre and Pitch

Instrument recognition is a fundamental task in music information retrie...
research
04/30/2023

Transfer of knowledge among instruments in automatic music transcription

Automatic music transcription (AMT) is one of the most challenging tasks...
research
08/11/2020

Transfer Learning for Improving Singing-voice Detection in Polyphonic Instrumental Music

Detecting singing-voice in polyphonic instrumental music is critical to ...
research
07/13/2021

Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music

Modern keyboards allow a musician to play multiple instruments at the sa...
research
11/15/2022

Music Instrument Classification Reprogrammed

The performance of approaches to Music Instrument Classification, a popu...

Please sign up or login with your details

Forgot password? Click here to reset