Efficient Integration of Multi-channel Information for Speaker-independent Speech Separation

05/23/2020
by   Yuichiro Koyama, et al.
0

Although deep-learning-based methods have markedly improved the performance of speech separation over the past few years, it remains an open question how to integrate multi-channel signals for speech separation. We propose two methods, namely, early-fusion and late-fusion methods, to integrate multi-channel information based on the time-domain audio separation network, which has been proven effective in single-channel speech separation. We also propose channel-sequential-transfer learning, which is a transfer learning framework that applies the parameters trained for a lower-channel network as the initial values of a higher-channel network. For fair comparison, we evaluated our proposed methods using a spatialized version of the wsj0-2mix dataset, which is open-sourced. It was found that our proposed methods can outperform multi-channel deep clustering and improve the performance proportionally to the number of microphones. It was also proven that the performance of the late-fusion method is consistently higher than that of the single-channel method regardless of the angle difference between speakers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2023

A Multi-Stage Triple-Path Method for Speech Separation in Noisy and Reverberant Environments

In noisy and reverberant environments, the performance of deep learning-...
research
05/17/2019

A comprehensive study of speech separation: spectrogram vs waveform separation

Speech separation has been studied widely for single-channel close-talk ...
research
05/15/2019

End-to-End Multi-Channel Speech Separation

The end-to-end approach for single-channel speech separation has been st...
research
10/27/2022

CasNet: Investigating Channel Robustness for Speech Separation

Recording channel mismatch between training and testing conditions has b...
research
03/11/2022

Improving the transferability of speech separation by meta-learning

Speech separation aims to separate multiple speech sources from a speech...
research
10/23/2020

Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer

With its strong modeling capacity that comes from a multi-head and multi...
research
12/17/2019

A Unified Framework for Speech Separation

Speech separation refers to extracting each individual speech source in ...

Please sign up or login with your details

Forgot password? Click here to reset