Polyphonic Pitch Tracking with Deep Layered Learning

04/09/2018
by   Anders Elowsson, et al.
0

This paper presents a polyphonic pitch tracking system able to extract both framewise and note-based estimates from audio. The system uses six artificial neural networks in a deep layered learning setup. First, cascading networks are applied to a spectrogram for framewise fundamental frequency (f0) estimation. A sparse receptive field is learned by the first network and then used for weight-sharing throughout the system. The f0-activations are connected across time to extract pitch ridges. These ridges define a framework, within which subsequent networks perform tone-shift-invariant onset and offset detection. The networks convolve the pitch ridges across time, using as input, e.g., variations of latent representations from the f0 estimation networks, defined as the "neural flux." Finally, incorrect tentative notes are removed one by one in an iterative procedure that allows a network to classify notes within an accurate context. The system was evaluated on four public datasets (MAPS, Bach10, Trios, and the MIREX Woodwind quintet), and performed state-of-the-art results for all four datasets. The system performs well across all subtasks: f0, pitched onset, and pitched offset tracking.

READ FULL TEXT

page 22

page 31

research
04/20/2020

Colonoscope tracking method based on shape estimation network

This paper presents a colonoscope tracking method utilizing a colon shap...
research
04/22/2018

Tempo-Invariant Processing of Rhythm with Convolutional Neural Networks

Rhythm patterns can be performed with a wide variation of tempi. This pr...
research
10/30/2016

Feature-Augmented Neural Networks for Patient Note De-identification

Patient notes contain a wealth of information of potentially great inter...
research
06/28/2018

GenerationMania: Learning to Semantically Choreograph

Beatmania is a rhythm action game where players play the role of a DJ th...
research
04/13/2005

Self-Organizing Multilayered Neural Networks of Optimal Complexity

The principles of self-organizing the neural networks of optimal complex...
research
02/27/2022

Hierarchical Linear Dynamical System for Representing Notes from Recorded Audio

We seek to develop simultaneous segmentation and classification of notes...
research
09/04/2019

Towards Interpretable Polyphonic Transcription with Invertible Neural Networks

We explore a novel way of conceptualising the task of polyphonic music t...

Please sign up or login with your details

Forgot password? Click here to reset