Multichannel Speech Separation with Narrow-band Conformer

04/09/2022
by   Changsheng Quan, et al.
0

This work proposes a multichannel speech separation method with narrow-band Conformer (named NBC). The network is trained to learn to automatically exploit narrow-band speech separation information, such as spatial vector clustering of multiple speakers. Specifically, in the short-time Fourier transform (STFT) domain, the network processes each frequency independently, and is shared by all frequencies. For one frequency, the network inputs the STFT coefficients of multichannel mixture signals, and predicts the STFT coefficients of separated speech signals. Clustering of spatial vectors shares a similar principle with the self-attention mechanism in the sense of computing the similarity of vectors and then aggregating similar vectors. Therefore, Conformer would be especially suitable for the present problem. Experiments show that the proposed narrow-band Conformer achieves better speech separation performance than other state-of-the-art methods by a large margin.

READ FULL TEXT
research
12/05/2022

NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer

This work proposes a multichannel narrow-band speech separation network....
research
10/12/2021

Multi-channel Narrow-Band Deep Speech Separation with Full-band Permutation Invariant Training

This paper addresses the problem of multi-channel multi-speech separatio...
research
07/31/2023

SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation

This work proposes a neural network to extensively exploit spatial infor...
research
04/05/2022

On the Relevance of Bandwidth Extension for Speaker Verification

In this paper, we consider the effect of a bandwidth extension of narrow...
research
11/22/2022

Deep Neural Mel-Subband Beamformer for In-car Speech Separation

While current deep learning (DL)-based beamforming techniques have been ...
research
04/10/2019

Expectation-Maximization for Speech Source Separation Using Convolutive Transfer Function

This paper addresses the problem of under-determinded speech source sepa...
research
10/10/2021

Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain

The crux of single-channel speech separation is how to encode the mixtur...

Please sign up or login with your details

Forgot password? Click here to reset