Bootstrapping deep music separation from primitive auditory grouping principles

10/23/2019
by   Prem Seetharaman, et al.
0

Separating an audio scene such as a cocktail party into constituent, meaningful components is a core task in computer audition. Deep networks are the state-of-the-art approach. They are trained on synthetic mixtures of audio made from isolated sound source recordings so that ground truth for the separation is known. However, the vast majority of available audio is not isolated. The brain uses primitive cues that are independent of the characteristics of any particular sound source to perform an initial segmentation of the audio scene. We present a method for bootstrapping a deep model for music source separation without ground truth by using multiple primitive cues. We apply our method to train a network on a large set of unlabeled music recordings from YouTube to separate vocals from accompaniment without the need for ground truth isolated sources or artificial training mixtures.

READ FULL TEXT
research
11/06/2018

Bootstrapping single-channel source separation via unsupervised spatial clustering on stereo mixtures

Separating an audio scene into isolated sources is a fundamental problem...
research
11/05/2018

Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures using Spatial Information

We present a monophonic source separation system that is trained by only...
research
11/06/2019

Finding Strength in Weakness: Learning to Separate Sounds with Weak Supervision

While there has been much recent progress using deep learning techniques...
research
10/23/2019

Model selection for deep audio source separation via clustering analysis

Audio source separation is the process of separating a mixture (e.g. a p...
research
01/03/2021

Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation

This paper addresses the problem of domain adaptation for the task of mu...
research
12/15/2019

Breaking Speech Recognizers to Imagine Lyrics

We introduce a new method for generating text, and in particular song ly...
research
07/09/2021

Blind Source Separation in Polyphonic Music Recordings Using Deep Neural Networks Trained via Policy Gradients

We propose a method for the blind separation of sounds of musical instru...

Please sign up or login with your details

Forgot password? Click here to reset