TrOMR:Transformer-Based Polyphonic Optical Music Recognition

08/18/2023
by   Yixuan Li, et al.
0

Optical Music Recognition (OMR) is an important technology in music and has been researched for a long time. Previous approaches for OMR are usually based on CNN for image understanding and RNN for music symbol classification. In this paper, we propose a transformer-based approach with excellent global perceptual capability for end-to-end polyphonic OMR, called TrOMR. We also introduce a novel consistency loss function and a reasonable approach for data annotation to improve recognition accuracy for complex music scores. Extensive experiments demonstrate that TrOMR outperforms current OMR methods, especially in real-world scenarios. We also develop a TrOMR system and build a camera scene dataset for full-page music scores in real-world. The code and datasets will be made available for reproducibility.

READ FULL TEXT

page 1

page 2

research
10/26/2020

Residual Recurrent CRNN for End-to-End Optical Music Recognition on Monophonic Scores

Optical Music Recognition is a field that attempts to extract digital in...
research
07/16/2017

Optical Music Recognition with Convolutional Sequence-to-Sequence Models

Optical Music Recognition (OMR) is an important technology within Music ...
research
08/03/2021

An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition

Previous work has shown that neural architectures are able to perform op...
research
03/14/2017

In Search of a Dataset for Handwritten Optical Music Recognition: Introducing MUSCIMA++

Optical Music Recognition (OMR) has long been without an adequate datase...
research
08/05/2017

Detecting Noteheads in Handwritten Scores with ConvNets and Bounding Box Regression

Noteheads are the interface between the written score and music. Each no...
research
12/07/2022

Name Your Colour For the Task: Artificially Discover Colour Naming via Colour Quantisation Transformer

The long-standing theory that a colour-naming system evolves under the d...
research
11/30/2021

Donut: Document Understanding Transformer without OCR

Understanding document images (e.g., invoices) has been an important res...

Please sign up or login with your details

Forgot password? Click here to reset