Distributed Microphone Speech Enhancement based on Deep Learning

11/19/2019
by   Syu-Siang Wang, et al.
0

Speech-related applications deliver inferior performance in complex noise environments. Therefore, this study primarily addresses this problem by introducing speech-enhancement (SE) systems based on deep neural networks (DNNs) applied to a distributed microphone architecture. The first system constructs a DNN model for each microphone to enhance the recorded noisy speech signal, and the second system combines all the noisy recordings into a large feature structure that is then enhanced through a DNN model. As for the third system, a channel-dependent DNN is first used to enhance the corresponding noisy input, and all the channel-wise enhanced outputs are fed into a DNN fusion model to construct a nearly clean signal. All the three DNN SE systems are operated in the acoustic frequency domain of speech signals in a diffuse-noise field environment. Evaluation experiments were conducted on the Taiwan Mandarin Hearing in Noise Test (TMHINT) database, and the results indicate that all the three DNN-based SE systems provide the original noise-corrupted signals with improved speech quality and intelligibility, whereas the third system delivers the highest signal-to-noise ratio (SNR) improvement and optimal speech intelligibility.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
10/19/2021

Speech Enhancement-assisted Stargan Voice Conversion in Noisy Environments

Numerous voice conversion (VC) techniques have been proposed for the con...
research
12/02/2022

Injecting Spatial Information for Monaural Speech Enhancement via Knowledge Distillation

Monaural speech enhancement (SE) provides a versatile and cost-effective...
research
09/03/2023

Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement

Speech emotion recognition (SER) often experiences reduced performance d...
research
02/02/2018

Monaural Speech Enhancement using Deep Neural Networks by Maximizing a Short-Time Objective Intelligibility Measure

In this paper we propose a Deep Neural Network (DNN) based Speech Enhanc...
research
05/29/2017

DNN-based uncertainty estimation for weighted DNN-HMM ASR

In this paper, the uncertainty is defined as the mean square error betwe...
research
11/16/2022

A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training

This paper focuses on leveraging deep representation learning (DRL) for ...
research
10/27/2022

Audio Signal Enhancement with Learning from Positive and Unlabelled Data

Supervised learning is a mainstream approach to audio signal enhancement...

Please sign up or login with your details

Forgot password? Click here to reset