Optimal spectral transportation with application to music transcription

09/30/2016
by   Rémi Flamary, et al.
0

Many spectral unmixing methods rely on the non-negative decomposition of spectral data onto a dictionary of spectral templates. In particular, state-of-the-art music transcription systems decompose the spectrogram of the input signal onto a dictionary of representative note spectra. The typical measures of fit used to quantify the adequacy of the decomposition compare the data and template entries frequency-wise. As such, small displacements of energy from a frequency bin to another as well as variations of timber can disproportionally harm the fit. We address these issues by means of optimal transportation and propose a new measure of fit that treats the frequency distributions of energy holistically as opposed to frequency-wise. Building on the harmonic nature of sound, the new measure is invariant to shifts of energy to harmonically-related frequencies, as well as to small and local displacements of energy. Equipped with this new measure of fit, the dictionary of note templates can be considerably simplified to a set of Dirac vectors located at the target fundamental frequencies (musical pitch values). This in turns gives ground to a very fast and simple decomposition algorithm that achieves state-of-the-art performance on real musical data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2021

Tonal Frequencies, Consonance, Dissonance: A Math-Bio Intersection

To date, calculating the frequencies of musical notes requires one to kn...
research
02/11/2020

Periodicity Pitch Detection in Complex Harmonies on EEG Timeline Data

An acoustic stimulus, e.g., a musical harmony, is transformed in a highl...
research
06/17/2014

Automatic Fado Music Classification

In late 2011, Fado was elevated to the oral and intangible heritage of h...
research
08/26/2022

Mel Spectrogram Inversion with Stable Pitch

Vocoders are models capable of transforming a low-dimensional spectral r...
research
08/23/2021

Differential Music: Automated Music Generation Using LSTM Networks with Representation Based on Melodic and Harmonic Intervals

This paper presents a generative AI model for automated music compositio...
research
06/01/2018

Musical Instrument Separation on Shift-Invariant Spectrograms via Stochastic Dictionary Learning

We propose a method for the blind separation of audio signals from music...
research
09/19/2018

Switching divergences for spectral learning in blind speech dereverberation

When recorded in an enclosed room, a sound signal will most certainly ge...

Please sign up or login with your details

Forgot password? Click here to reset