Class-conditional embeddings for music source separation

11/07/2018
by   Prem Seetharaman, et al.
0

Isolating individual instruments in a musical mixture has a myriad of potential applications, and seems imminently achievable given the levels of performance reached by recent deep learning methods. While most musical source separation techniques learn an independent model for each instrument, we propose using a common embedding space for the time-frequency bins of all instruments in a mixture inspired by deep clustering and deep attractor networks. Additionally, an auxiliary network is used to generate parameters of a Gaussian mixture model (GMM) where the posterior distribution over GMM components in the embedding space can be used to create a mask that separates individual sources from a mixture. In addition to outperforming a mask-inference baseline on the MUSDB-18 dataset, our embedding space is easily interpretable and can be used for query-based separation.

READ FULL TEXT

page 1

page 4

research
10/19/2020

Fast accuracy estimation of deep learning based multi-class musical source separation

Music source separation represents the task of extracting all the instru...
research
12/09/2022

Hyperbolic Audio Source Separation

We introduce a framework for audio source separation using embeddings on...
research
09/29/2020

Bespoke Neural Networks for Score-Informed Source Separation

In this paper, we introduce a simple method that can separate arbitrary ...
research
08/29/2019

Deep Bayesian Unsupervised Source Separation Based on a Complex Gaussian Mixture Model

This paper presents an unsupervised method that trains neural source sep...
research
11/18/2016

Deep Clustering and Conventional Networks for Music Separation: Stronger Together

Deep clustering is the first method to handle general audio separation s...
research
03/18/2019

A Vocoder Based Method For Singing Voice Extraction

This paper presents a novel method for extracting the vocal track from a...
research
03/23/2021

Learned complex masks for multi-instrument source separation

Music source separation in the time-frequency domain is commonly achieve...

Please sign up or login with your details

Forgot password? Click here to reset