Improving Source Separation via Multi-Speaker Representations

08/29/2017
by   Jeroen Zegers, et al.
0

Lately there have been novel developments in deep learning towards solving the cocktail party problem. Initial results are very promising and allow for more research in the domain. One technique that has not yet been explored in the neural network approach to this task is speaker adaptation. Intuitively, information on the speakers that we are trying to separate seems fundamentally important for the speaker separation task. However, retrieving this speaker information is challenging since the speaker identities are not known a priori and multiple speakers are simultaneously active. There is thus some sort of chicken and egg problem. To tackle this, source signals and i-vectors are estimated alternately. We show that blind multi-speaker adaptation improves the results of the network and that (in our case) the network is not capable of adequately retrieving this useful speaker information itself.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/24/2018

Multi-scenario deep learning for multi-speaker source separation

Research in deep learning for multi-speaker source separation has receiv...
research
12/05/2017

Multi-speaker Recognition in Cocktail Party Problem

This paper proposes an original statistical decision theory to accomplis...
research
05/26/2019

Auditory Separation of a Conversation from Background via Attentional Gating

We present a model for separating a set of voices out of a sound mixture...
research
05/12/2017

Monaural Audio Speaker Separation with Source Contrastive Estimation

We propose an algorithm to separate simultaneously speaking persons from...
research
01/25/2023

Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation

The problem of speech separation, also known as the cocktail party probl...
research
12/12/2017

Classification vs. Regression in Supervised Learning for Single Channel Speaker Count Estimation

The task of estimating the maximum number of concurrent speakers from si...
research
09/12/2017

Addressee and Response Selection in Multi-Party Conversations with Speaker Interaction RNNs

In this paper, we study the problem of addressee and response selection ...

Please sign up or login with your details

Forgot password? Click here to reset