Deep Ad-hoc Beamforming

11/03/2018
by   Xiao-Lei Zhang, et al.
0

Deep learning based speech enhancement methods face two problems. First, their performance is strongly affected by the distance between the speech source and the microphones. Second, unlike conventional methods, deep-learning-based multichannel methods do not show significant performance improvement over their single-channel counterpart. To address the above problem, we propose deep ad-hoc beamforming---the first deep-learning-based multichannel speech enhancement method in an ad-hoc microphone array. It serves for scenarios where the microphones are placed randomly in a room and work collaboratively. It aims to pick up speech signals with equally good quality in a range where the array covers. Its core idea is to reweight the estimated speech signals when conducting beamforming, where the weights produced by a neural network are an estimation of the signal-to-noise ratios at the microphone array. We conducted an experiment in a scenario where the location of the speech source is far-field, random, and blind to the microphones. Results show that our method outperforms representative deep-learning-based speech enhancement methods by a large margin.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/01/2020

Deep Ad-hoc Beamforming Based on Speaker Extraction for Target-Dependent Speech Separation

Recently, the research on ad-hoc microphone arrays with deep learning ha...
research
02/15/2018

Deep Learning Based Speech Beamforming

Multi-channel speech enhancement with ad-hoc sensors has been a challeng...
research
11/07/2020

Enhancement by postfiltering for speech and audio coding in ad-hoc sensor networks

Enhancement algorithms for wireless acoustics sensor networks (WASNs) ar...
research
11/04/2022

Speech enhancement using ego-noise references with a microphone array embedded in an unmanned aerial vehicle

A method is proposed for performing speech enhancement using ego-noise r...
research
07/27/2021

Microphone Array Generalization for Multichannel Narrowband Deep Speech Enhancement

This paper addresses the problem of microphone array generalization for ...
research
01/24/2022

PickNet: Real-Time Channel Selection for Ad Hoc Microphone Arrays

This paper proposes PickNet, a neural network model for real-time channe...
research
10/22/2021

TADRN: Triple-Attentive Dual-Recurrent Network for Ad-hoc Array Multichannel Speech Enhancement

Deep neural networks (DNNs) have been successfully used for multichannel...

Please sign up or login with your details

Forgot password? Click here to reset