Audio-to-score alignment of piano music using RNN-based automatic music transcription

11/13/2017
by   Taegyun Kwon, et al.
0

We propose a framework for audio-to-score alignment on piano performance that employs automatic music transcription (AMT) using neural networks. Even though the AMT result may contain some errors, the note prediction output can be regarded as a learned feature representation that is directly comparable to MIDI note or chroma representation. To this end, we employ two recurrent neural networks that work as the AMT-based feature extractors to the alignment algorithm. One predicts the presence of 88 notes or 12 chroma in frame-level and the other detects note onsets in 12 chroma. We combine the two types of learned features for the audio-to-score alignment. For comparability, we apply dynamic time warping as an alignment algorithm without any additional post-processing. We evaluate the proposed framework on the MAPS dataset and compare it to previous work. The result shows that the alignment framework with the learned features significantly improves the accuracy, achieving less than 10 ms in mean onset error.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/27/2021

Audio-to-Score Alignment Using Deep Automatic Music Transcription

Audio-to-score alignment (A2SA) is a multimodal task consisting in the a...
research
07/07/2023

Roman Numeral Analysis with Graph Neural Networks: Onset-wise Predictions from Note-wise Features

Roman Numeral analysis is the important task of identifying chords and t...
research
10/26/2019

A holistic approach to polyphonic music transcription with neural networks

We present a framework based on neural networks to extract music scores ...
research
02/18/2019

End-to-end Lyrics Alignment for Polyphonic Music Using an Audio-to-Character Recognition Model

Time-aligned lyrics can enrich the music listening experience by enablin...
research
05/10/2021

Multi-modal Conditional Bounding Box Regression for Music Score Following

This paper addresses the problem of sheet-image-based on-line audio-to-s...
research
07/29/2020

Improved Handling of Repeats and Jumps in Audio-Sheet Image Synchronization

This paper studies the problem of automatically generating piano score f...
research
08/04/2020

Exact, Parallelizable Dynamic Time Warping Alignment with Linear Memory

Audio alignment is a fundamental preprocessing step in many MIR pipeline...

Please sign up or login with your details

Forgot password? Click here to reset