Efficient Independent Vector Extraction of Dominant Target Speech

08/01/2020
by   Lele Liao, et al.
0

The complete decomposition performed by blind source separation is computationally demanding and superfluous when only the speech of one specific target speaker is desired. In this paper, we propose a computationally efficient blind speech extraction method based on a proper modification of the commonly utilized independent vector analysis algorithm, under the mild assumption that the average power of signal of interest outweighs interfering speech sources. Considering that the minimum distortion principle cannot be implemented since the full demixing matrix is not available, we also design a one-unit scaling operation to solve the scaling ambiguity. Simulations validate the efficacy of the proposed method in extracting the dominant speech.

READ FULL TEXT
research
02/09/2021

Independent Vector Extraction for Joint Blind Source Separation and Dereverberation

We address a blind source separation (BSS) problem in a noisy reverberan...
research
12/10/2018

A Computationally Efficient and Practically Feasible Two Microphones Blind Speech Separation Method

Traditionally, Blind Speech Separation techniques are computationally ex...
research
10/25/2019

Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors

We propose a novel algorithm for adaptive blind audio source extraction....
research
08/04/2021

Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation

This paper proposes an approach for optimizing a Convolutional BeamForme...
research
10/19/2020

Attention-based scaling adaptation for target speech extraction

The target speech extraction has attracted widespread attention in recen...
research
11/05/2021

Blind Extraction of Target Speech Source Guided by Supervised Speaker Identification via X-vectors

This manuscript proposes a novel robust procedure for extraction of a sp...
research
01/19/2018

Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals

Time- and pitch-scale modifications of speech signals find important app...

Please sign up or login with your details

Forgot password? Click here to reset