A holistic approach to polyphonic music transcription with neural networks

10/26/2019
by   Miguel A. Román, et al.
0

We present a framework based on neural networks to extract music scores directly from polyphonic audio in an end-to-end fashion. Most previous Automatic Music Transcription (AMT) methods seek a piano-roll representation of the pitches, that can be further transformed into a score by incorporating tempo estimation, beat tracking, key estimation or rhythm quantization. Unlike these methods, our approach generates music notation directly from the input audio in a single stage. For this, we use a Convolutional Recurrent Neural Network (CRNN) with Connectionist Temporal Classification (CTC) loss function which does not require annotated alignments of audio frames with the score rhythmic information. We trained our model using as input Haydn, Mozart, and Beethoven string quartets and Bach chorales synthesized with different tempos and expressive performances. The output is a textual representation of four-voice music scores based on **kern format. Although the proposed approach is evaluated in a simplified scenario, results show that this model can learn to transcribe scores directly from audio signals, opening a promising avenue towards complete AMT.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2020

Residual Recurrent CRNN for End-to-End Optical Music Recognition on Monophonic Scores

Optical Music Recognition is a field that attempts to extract digital in...
research
11/13/2017

Audio-to-score alignment of piano music using RNN-based automatic music transcription

We propose a framework for audio-to-score alignment on piano performance...
research
05/26/2016

Robust Downbeat Tracking Using an Ensemble of Convolutional Networks

In this paper, we present a novel state of the art system for automatic ...
research
06/03/2016

Gaussian Processes for Music Audio Modelling and Content Analysis

Real music signals are highly variable, yet they have strong statistical...
research
11/13/2020

A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions

The utilization of deep learning techniques in generating various conten...
research
01/14/2019

Music Artist Classification with Convolutional Recurrent Neural Networks

Previous attempts at music artist classification use frame-level audio f...
research
07/03/2018

Weakly Supervised Deep Recurrent Neural Networks for Basic Dance Step Generation

A deep recurrent neural network with audio input is applied to model bas...

Please sign up or login with your details

Forgot password? Click here to reset