Optical Music Recognition with Convolutional Sequence-to-Sequence Models

07/16/2017
by   Eelco van der Wel, et al.
0

Optical Music Recognition (OMR) is an important technology within Music Information Retrieval. Deep learning models show promising results on OMR tasks, but symbol-level annotated data sets of sufficient size to train such models are not available and difficult to develop. We present a deep learning architecture called a Convolutional Sequence-to-Sequence model to both move towards an end-to-end trainable OMR pipeline, and apply a learning process that trains on full sentences of sheet music instead of individually labeled symbols. The model is trained and evaluated on a human generated data set, with various image augmentations based on real-world scenarios. This data set is the first publicly available set in OMR research with sufficient size to train and evaluate deep learning models. With the introduced augmentations a pitch recognition accuracy of 81 resulting in a note level accuracy of 80 commercially available methods, showing a large improvements over these applications.

READ FULL TEXT
research
10/26/2020

Residual Recurrent CRNN for End-to-End Optical Music Recognition on Monophonic Scores

Optical Music Recognition is a field that attempts to extract digital in...
research
08/18/2023

TrOMR:Transformer-Based Polyphonic Optical Music Recognition

Optical Music Recognition (OMR) is an important technology in music and ...
research
02/11/2021

DEEPF0: End-To-End Fundamental Frequency Estimation for Music and Speech Signals

We propose a novel pitch estimation technique called DeepF0, which lever...
research
05/26/2018

Deep Watershed Detector for Music Object Recognition

Optical Music Recognition (OMR) is an important and challenging area wit...
research
08/03/2021

An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition

Previous work has shown that neural architectures are able to perform op...
research
01/15/2013

Audio Classical Composer Identification by Deep Neural Network

Audio Classical Composer Identification (ACC) is an important problem in...
research
01/11/2022

Region-based Layout Analysis of Music Score Images

The Layout Analysis (LA) stage is of vital importance to the correct per...

Please sign up or login with your details

Forgot password? Click here to reset