Model selection for deep audio source separation via clustering analysis

10/23/2019
by   Alisa Liu, et al.
0

Audio source separation is the process of separating a mixture (e.g. a pop band recording) into isolated sounds from individual sources (e.g. just the lead vocals). Deep learning models are the state-of-the-art in source separation, given that the mixture to be separated is similar to the mixtures the deep model was trained on. This requires the end user to know enough about each model's training to select the correct model for a given audio mixture. In this work, we automate selection of the appropriate model for an audio mixture. We present a confidence measure that does not require ground truth to estimate separation quality, given a deep model and audio mixture. We use this confidence measure to automatically select the model output with the best predicted separation quality. We compare our confidence-based ensemble approach to using individual models with no selection, to an oracle that always selects the best model and to a random model selector. Results show our confidence-based ensemble significantly outperforms the random ensemble over general mixtures and approaches oracle performance for music mixtures.

READ FULL TEXT
research
01/24/2022

Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Supervised deep learning approaches to underdetermined audio source sepa...
research
07/28/2021

Neural Remixer: Learning to Remix Music with Interactive Control

The task of manipulating the level and/or effects of individual instrume...
research
11/11/2017

Unsupervised Audio Source Separation via Spectrum Energy Preserved Wasserstein Learning

Separating audio mixtures into individual tracks has been a long standin...
research
07/21/2021

Controlling the Remixing of Separated Dialogue with a Non-Intrusive Quality Estimate

Remixing separated audio sources trades off interferer attenuation again...
research
07/27/2023

Complete and separate: Conditional separation with missing target source attribute completion

Recent approaches in source separation leverage semantic information abo...
research
10/23/2019

Bootstrapping deep music separation from primitive auditory grouping principles

Separating an audio scene such as a cocktail party into constituent, mea...
research
11/06/2019

Finding Strength in Weakness: Learning to Separate Sounds with Weak Supervision

While there has been much recent progress using deep learning techniques...

Please sign up or login with your details

Forgot password? Click here to reset