Deep Transform: Cocktail Party Source Separation via Probabilistic Re-Synthesis

03/20/2015
by   Andrew J. R. Simpson, et al.
0

In cocktail party listening scenarios, the human brain is able to separate competing speech signals. However, the signal processing implemented by the brain to perform cocktail party listening is not well understood. Here, we trained two separate convolutive autoencoder deep neural networks (DNN) to separate monaural and binaural mixtures of two concurrent speech streams. We then used these DNNs as convolutive deep transform (CDT) devices to perform probabilistic re-synthesis. The CDTs operated directly in the time-domain. Our simulations demonstrate that very simple neural networks are capable of exploiting monaural and binaural information available in a cocktail party listening scenario.

READ FULL TEXT
research
03/24/2015

Probabilistic Binary-Mask Cocktail-Party Source Separation in a Convolutional Deep Neural Network

Separation of competing speech is a key challenge in signal processing a...
research
04/12/2015

Deep Transform: Cocktail Party Source Separation via Complex Convolution in a Deep Neural Network

Convolutional deep neural networks (DNN) are state of the art in many en...
research
04/17/2015

Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network

Identification and extraction of singing voice from within musical mixtu...
research
03/19/2015

Deep Transform: Time-Domain Audio Error Correction via Probabilistic Re-Synthesis

In the process of recording, storage and transmission of time-domain aud...
research
02/01/2022

Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds

A differentiable digital signal processing (DDSP) autoencoder is a music...
research
12/17/2021

Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem

Deep learning based models have significantly improved the performance o...
research
09/04/2020

Towards Musically Meaningful Explanations Using Source Separation

Deep neural networks (DNNs) are successfully applied in a wide variety o...

Please sign up or login with your details

Forgot password? Click here to reset