Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network

04/17/2015
by   Andrew J. R. Simpson, et al.
0

Identification and extraction of singing voice from within musical mixtures is a key challenge in source separation and machine audition. Recently, deep neural networks (DNN) have been used to estimate 'ideal' binary masks for carefully controlled cocktail party speech separation problems. However, it is not yet known whether these methods are capable of generalizing to the discrimination of voice and non-voice in the context of musical mixtures. Here, we trained a convolutional DNN (of around a billion parameters) to provide probabilistic estimates of the ideal binary mask for separation of vocal sounds from real-world musical mixtures. We contrast our DNN results with more traditional linear methods. Our approach may be useful for automatic removal of vocal sounds from musical mixtures for 'karaoke' type applications.

READ FULL TEXT
research
03/24/2015

Probabilistic Binary-Mask Cocktail-Party Source Separation in a Convolutional Deep Neural Network

Separation of competing speech is a key challenge in signal processing a...
research
12/02/2019

Investigating Deep Neural Transformations for Spectrogram-based Musical Source Separation

Musical Source Separation (MSS) is a signal processing task that tries t...
research
04/12/2015

Deep Transform: Cocktail Party Source Separation via Complex Convolution in a Deep Neural Network

Convolutional deep neural networks (DNN) are state of the art in many en...
research
03/20/2015

Deep Transform: Cocktail Party Source Separation via Probabilistic Re-Synthesis

In cocktail party listening scenarios, the human brain is able to separa...
research
03/18/2019

A Vocoder Based Method For Singing Voice Extraction

This paper presents a novel method for extracting the vocal track from a...
research
02/12/2020

Content Based Singing Voice Extraction From a Musical Mixture

We present a deep learning based methodology for extracting the singing ...
research
04/28/2023

Musical Voice Separation as Link Prediction: Modeling a Musical Perception Task as a Multi-Trajectory Tracking Problem

This paper targets the perceptual task of separating the different inter...

Please sign up or login with your details

Forgot password? Click here to reset