An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition

08/03/2021
by   Sachinda Edirisooriya, et al.
0

Previous work has shown that neural architectures are able to perform optical music recognition (OMR) on monophonic and homophonic music with high accuracy. However, piano and orchestral scores frequently exhibit polyphonic passages, which add a second dimension to the task. Monophonic and homophonic music can be described as homorhythmic, or having a single musical rhythm. Polyphonic music, on the other hand, can be seen as having multiple rhythmic sequences, or voices, concurrently. We first introduce a workflow for creating large-scale polyphonic datasets suitable for end-to-end recognition from sheet music publicly available on the MuseScore forum. We then propose two novel formulations for end-to-end polyphonic OMR – one treating the problem as a type of multi-task binary classification, and the other treating it as multi-sequence detection. Building upon the encoder-decoder architecture and an image encoder proposed in past work on end-to-end OMR, we propose two novel decoder models – FlagDecoder and RNNDecoder – that correspond to the two formulations. Finally, we compare the empirical performance of these end-to-end approaches to polyphonic OMR and observe a new state-of-the-art performance with our multi-sequence detection decoder, RNNDecoder.

READ FULL TEXT

page 5

page 6

research
10/26/2020

Residual Recurrent CRNN for End-to-End Optical Music Recognition on Monophonic Scores

Optical Music Recognition is a field that attempts to extract digital in...
research
11/20/2017

End-to-end Trained CNN Encode-Decoder Networks for Image Steganography

All the existing image steganography methods use manually crafted featur...
research
08/18/2023

TrOMR:Transformer-Based Polyphonic Optical Music Recognition

Optical Music Recognition (OMR) is an important technology in music and ...
research
07/16/2017

Optical Music Recognition with Convolutional Sequence-to-Sequence Models

Optical Music Recognition (OMR) is an important technology within Music ...
research
11/12/2019

Multi-Step Chord Sequence Prediction Based on Aggregated Multi-Scale Encoder-Decoder Network

This paper studies the prediction of chord progressions for jazz music b...
research
07/15/2022

PoLyScriber: Integrated Training of Extractor and Lyrics Transcriber for Lyrics Transcription in Polyphonic Music

Lyrics transcription of polyphonic music is challenging as the backgroun...
research
09/18/2023

Positive and Risky Message Assessment for Music Products

In this work, we propose a novel research problem: assessing positive an...

Please sign up or login with your details

Forgot password? Click here to reset