Score-informed syllable segmentation for a cappella singing voice with convolutional neural networks

07/12/2017
by   Jordi Pons, et al.
0

This paper introduces a new score-informed method for the segmentation of jingju a cappella singing phrase into syllables. The proposed method estimates the most likely sequence of syllable boundaries given the estimated syllable onset detection function (ODF) and its score. Throughout the paper, we first examine the jingju syllables structure and propose a definition of the term "syllable onset". Then, we identify which are the challenges that jingju a cappella singing poses. Further, we investigate how to improve the syllable ODF estimation with convolutional neural networks (CNNs). We propose a novel CNN architecture that allows to efficiently capture different time-frequency scales for estimating syllable onsets. In addition, we propose using a score-informed Viterbi algorithm -instead of thresholding the onset function-, because the available musical knowledge we have (the score) can be used to inform the Viterbi algorithm in order to overcome the identified challenges. The proposed method outperforms the state-of-the-art in syllable segmentation for jingju a cappella singing. We further provide an analysis of the segmentation errors which points possible research directions.

READ FULL TEXT

page 2

page 3

page 4

research
06/05/2018

Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions

In this paper, we tackle the singing voice phoneme segmentation problem ...
research
10/24/2019

Fast and High-Quality Singing Voice Synthesis System based on Convolutional Neural Networks

The present paper describes singing voice synthesis based on convolution...
research
08/01/2020

Score-informed Networks for Music Performance Assessment

The assessment of music performances in most cases takes into account th...
research
04/15/2019

Singing voice synthesis based on convolutional neural networks

The present paper describes a singing voice synthesis based on convoluti...
research
09/09/2020

Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks

This paper addresses the extraction of multiple F0 values from polyphoni...
research
05/31/2017

3D Mesh Segmentation via Multi-branch 1D Convolutional Neural Networks

3D mesh segmentation is an important research area in computer graphics,...

Please sign up or login with your details

Forgot password? Click here to reset