Unsupervised Interpretable Representation Learning for Singing Voice Separation

03/03/2020
by   Stylianos I. Mimilakis, et al.
0

In this work, we present a method for learning interpretable music signal representations directly from waveform signals. Our method can be trained using unsupervised objectives and relies on the denoising auto-encoder model that uses a simple sinusoidal model as decoding functions to reconstruct the singing voice. To demonstrate the benefits of our method, we employ the obtained representations to the task of informed singing voice separation via binary masking, and measure the obtained separation quality by means of scale-invariant signal to distortion ratio. Our findings suggest that our method is capable of learning meaningful representations for singing voice separation, while preserving conveniences of the the short-time Fourier transform like non-negativity, smoothness, and reconstruction subject to time-frequency masking, that are desired in audio and music source separation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/06/2020

Revisiting Representation Learning for Singing Voice Separation with Sinkhorn Distances

In this work we present a method for unsupervised learning of audio repr...
research
11/03/2020

Complex ratio masking for singing voice separation

Music source separation is important for applications such as karaoke an...
research
09/02/2017

A Recurrent Encoder-Decoder Approach with Skip-filtering Connections for Monaural Singing Voice Separation

The objective of deep learning methods based on encoder-decoder architec...
research
11/04/2017

Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask

Singing voice separation based on deep learning relies on the usage of t...
research
08/11/2020

Exploring Aligned Lyrics-Informed Singing Voice Separation

In this paper, we propose a method of utilizing aligned lyrics as additi...
research
01/09/2018

Informed Group-Sparse Representation for Singing Voice Separation

Singing voice separation attempts to separate the vocal and instrumental...
research
05/07/2018

A Data-Driven Approach to Smooth Pitch Correction for Singing Voice in Pop Music

In this paper, we present a machine-learning approach to pitch correctio...

Please sign up or login with your details

Forgot password? Click here to reset