Particle Filtering for PLCA model with Application to Music Transcription

by   D. Cazau, et al.

Automatic Music Transcription (AMT) consists in automatically estimating the notes in an audio recording, through three attributes: onset time, duration and pitch. Probabilistic Latent Component Analysis (PLCA) has become very popular for this task. PLCA is a spectrogram factorization method, able to model a magnitude spectrogram as a linear combination of spectral vectors from a dictionary. Such methods use the Expectation-Maximization (EM) algorithm to estimate the parameters of the acoustic model. This algorithm presents well-known inherent defaults (local convergence, initialization dependency), making EM-based systems limited in their applications to AMT, particularly in regards to the mathematical form and number of priors. To overcome such limits, we propose in this paper to employ a different estimation framework based on Particle Filtering (PF), which consists in sampling the posterior distribution over larger parameter ranges. This framework proves to be more robust in parameter estimation, more flexible and unifying in the integration of prior knowledge in the system. Note-level transcription accuracies of 61.8 % and 59.5 % were achieved on evaluation sound datasets of two different instrument repertoires, including the classical piano (from MAPS dataset) and the marovany zither, and direct comparisons to previous PLCA-based approaches are provided. Steps for further development are also outlined.


An efficient particle-based method for maximum likelihood estimation in nonlinear state-space models

Data assimilation methods aim at estimating the state of a system by com...

Investigation on the use of Hidden-Markov Models in automatic transcription of music

Hidden Markov Models (HMMs) are a ubiquitous tool to model time series d...

Model uncertainty estimation using the expectation maximization algorithm and a particle flow filter

Model error covariances play a central role in the performance of data a...

Robust Parameter Estimation for the Lee-Carter Model: A Probabilistic Principal Component Approach

As a traditional and widely-adopted mortality rate projection technique,...

Efficient Learning of Harmonic Priors for Pitch Detection in Polyphonic Music

Automatic music transcription (AMT) aims to infer a latent symbolic repr...

Distributed Picard Iteration: Application to Distributed EM and Distributed PCA

In recent work, we proposed a distributed Picard iteration (DPI) that al...

Please sign up or login with your details

Forgot password? Click here to reset