Multi-channel Narrow-Band Deep Speech Separation with Full-band Permutation Invariant Training

10/12/2021
by   Changsheng Quan, et al.
0

This paper addresses the problem of multi-channel multi-speech separation based on deep learning techniques. In the short time Fourier transform domain, we propose an end-to-end narrow-band network that directly takes as input the multi-channel mixture signals of one frequency, and outputs the separated signals of this frequency. In narrow-band, the spatial information (or inter-channel difference) can well discriminate between speakers at different positions. This information is intensively used in many narrow-band speech separation methods, such as beamforming and clustering of spatial vectors. The proposed network is trained to learn a rule to automatically exploit this information and perform speech separation. Such a rule should be valid for any frequency, thence the network is shared by all frequencies. In addition, a full-band permutation invariant training criterion is proposed to solve the frequency permutation problem encountered by most narrow-band methods. Experiments show that, by focusing on deeply learning the narrow-band information, the proposed method outperforms the oracle beamforming method and the state-of-the-art deep learning based method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/09/2022

Multichannel Speech Separation with Narrow-band Conformer

This work proposes a multichannel speech separation method with narrow-b...
research
11/22/2022

Deep Neural Mel-Subband Beamformer for In-car Speech Separation

While current deep learning (DL)-based beamforming techniques have been ...
research
02/02/2019

Is CQT more suitable for monaural speech separation than STFT? an empirical study

Short-time Fourier transform (STFT) is used as the front end of many pop...
research
12/05/2022

NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer

This work proposes a multichannel narrow-band speech separation network....
research
03/13/2023

Multi-Microphone Speaker Separation by Spatial Regions

We consider the task of region-based source separation of reverberant mu...
research
10/30/2019

End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation

An important problem in ad-hoc microphone speech separation is how to gu...
research
04/05/2022

On the Relevance of Bandwidth Extension for Speaker Verification

In this paper, we consider the effect of a bandwidth extension of narrow...

Please sign up or login with your details

Forgot password? Click here to reset