Gaussian Processes for Music Audio Modelling and Content Analysis

06/03/2016
by   Pablo A. Alvarado, et al.
0

Real music signals are highly variable, yet they have strong statistical structure. Prior information about the underlying physical mechanisms by which sounds are generated and rules by which complex sound structure is constructed (notes, chords, a complete musical score), can be naturally unified using Bayesian modelling techniques. Typically algorithms for Automatic Music Transcription independently carry out individual tasks such as multiple-F0 detection and beat tracking. The challenge remains to perform joint estimation of all parameters. We present a Bayesian approach for modelling music audio, and content analysis. The proposed methodology based on Gaussian processes seeks joint estimation of multiple music concepts by incorporating into the kernel prior information about non-stationary behaviour, dynamics, and rich spectral content present in the modelled music signal. We illustrate the benefits of this approach via two tasks: pitch estimation, and inferring missing segments in a polyphonic audio recording.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/01/2021

Audio Content Analysis

Preprint for a book chapter introducing Audio Content Analysis. With a f...
research
05/19/2017

Efficient Learning of Harmonic Priors for Pitch Detection in Polyphonic Music

Automatic music transcription (AMT) aims to infer a latent symbolic repr...
research
10/26/2019

A holistic approach to polyphonic music transcription with neural networks

We present a framework based on neural networks to extract music scores ...
research
11/11/2018

PerformanceNet: Score-to-Audio Music Generation with Multi-Band Convolutional Residual Network

Music creation is typically composed of two parts: composing the musical...
research
07/22/2016

Similarity graphs for the concealment of long duration data loss in music

We present a novel method for the compensation of long duration data gap...
research
06/30/2022

libACA, pyACA, and ACA-Code: Audio Content Analysis in 3 Languages

The three packages libACA, pyACA, and ACA-Code provide reference impleme...
research
12/06/2022

FretNet: Continuous-Valued Pitch Contour Streaming for Polyphonic Guitar Tablature Transcription

In recent years, the task of Automatic Music Transcription (AMT), whereb...

Please sign up or login with your details

Forgot password? Click here to reset