The Cone of Silence: Speech Separation by Localization

10/12/2020
by   Teerapat Jenrungrot, et al.
0

Given a multi-microphone recording of an unknown number of speakers talking concurrently, we simultaneously localize the sources and separate the individual speakers. At the core of our method is a deep network, in the waveform domain, which isolates sources within an angular region θ± w/2, given an angle of interest θ and angular window size w. By exponentially decreasing w, we can perform a binary search to localize and separate all sources in logarithmic time. Our algorithm allows for an arbitrary number of potentially moving speakers at test time, including more speakers than seen during training. Experiments demonstrate state-of-the-art performance for both source separation and source localization, particularly in high levels of background noise.

READ FULL TEXT
research
06/04/2020

Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR

Most approaches to multi-talker overlapped speech separation and recogni...
research
11/24/2020

Multi-Decoder DPRNN: High Accuracy Source Counting and Separation

We propose an end-to-end trainable approach to single-channel speech sep...
research
05/26/2019

Auditory Separation of a Conversation from Background via Attentional Gating

We present a model for separating a set of voices out of a sound mixture...
research
03/30/2022

Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers

The vast majority of speech separation methods assume that the number of...
research
05/24/2022

SepIt: Approaching a Single Channel Speech Separation Bound

We present an upper bound for the Single Channel Speech Separation task,...
research
03/27/2020

Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss

Many recent source separation systems are designed to separate a fixed n...
research
03/23/2018

Convolutional vs. Recurrent Neural Networks for Audio Source Separation

Recent work has shown that recurrent neural networks can be trained to s...

Please sign up or login with your details

Forgot password? Click here to reset