Rhythm Transcription of Polyphonic Piano Music Based on Merged-Output HMM for Multiple Voices

01/29/2017
by   Eita Nakamura, et al.
0

In a recent conference paper, we have reported a rhythm transcription method based on a merged-output hidden Markov model (HMM) that explicitly describes the multiple-voice structure of polyphonic music. This model solves a major problem of conventional methods that could not properly describe the nature of multiple voices as in polyrhythmic scores or in the phenomenon of loose synchrony between voices. In this paper we present a complete description of the proposed model and develop an inference technique, which is valid for any merged-output HMMs for which output probabilities depend on past events. We also examine the influence of the architecture and parameters of the method in terms of accuracies of rhythm transcription and voice separation and perform comparative evaluations with six other algorithms. Using MIDI recordings of classical piano pieces, we found that the proposed model outperformed other methods by more than 12 points in the accuracy for polyrhythmic performances and performed almost as good as the best one for non-polyrhythmic performances. This reveals the state-of-the-art methods of rhythm transcription for the first time in the literature. Publicly available source codes are also provided for future comparisons.

READ FULL TEXT

page 1

page 9

research
06/04/2018

Revisiting Singing Voice Detection: a Quantitative Review and the Future Outlook

Since the vocal component plays a crucial role in popular music, singing...
research
04/08/2020

Comparison for Improvements of Singing Voice Detection System Based on Vocal Separation

Singing voice detection is the task to identify the frames which contain...
research
02/20/2021

Singer Identification Using Deep Timbre Feature Learning with KNN-Net

In this paper, we study the issue of automatic singer identification (SI...
research
06/05/2018

Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions

In this paper, we tackle the singing voice phoneme segmentation problem ...
research
07/06/2020

Revisiting Representation Learning for Singing Voice Separation with Sinkhorn Distances

In this work we present a method for unsupervised learning of audio repr...
research
04/08/2014

Outer-Product Hidden Markov Model and Polyphonic MIDI Score Following

We present a polyphonic MIDI score-following algorithm capable of follow...
research
04/08/2014

A Stochastic Temporal Model of Polyphonic MIDI Performance with Ornaments

We study indeterminacies in realization of ornaments and how they can be...

Please sign up or login with your details

Forgot password? Click here to reset