Speech Dereverberation Based on Integrated Deep and Ensemble Learning Algorithm

01/12/2018
by   Wei-Jen Lee, et al.
0

Reverberation, which is generally caused by sound reflections from walls, ceilings, and floors, can result in severe performance degradation of acoustic applications. Due to a complicated combination of attenuation and time-delay effects, the reverberation property is difficult to characterize, and it remains a challenging task to effectively retrieve the anechoic speech signals from reverberation ones. In the present study, we proposed a novel integrated deep and ensemble learning algorithm (IDEA) for speech dereverberation. The IDEA consists of offline and online phases. In the offline phase, we train multiple dereverberation models, each aiming to precisely dereverb speech signals in a particular acoustic environment; then a unified fusion function is estimated that aims to integrate the information of multiple dereverberation models. In the online phase, an input utterance is first processed by each of the dereverberation models. The outputs of all models are integrated accordingly to generate the final anechoic signal. We evaluated the IDEA on designed acoustic environments, including both matched and mismatched conditions of the training and testing data. Experimental results confirm that the proposed IDEA outperforms single deep-neural-network-based dereverberation model with the same model architecture and training data.

READ FULL TEXT
research
01/12/2018

Speech Dereverberation Based on Integrated Deep and Ensemble Learning

Reverberation, which is generally caused by sound reflections from walls...
research
08/17/2016

Ensemble of Jointly Trained Deep Neural Network-Based Acoustic Models for Reverberant Speech Recognition

Distant speech recognition is a challenge, particularly due to the corru...
research
10/21/2019

Signal Combination for Language Identification

Google's multilingual speech recognition system combines low-level acous...
research
12/17/2020

Speech Enhancement with Zero-Shot Model Selection

Recent research on speech enhancement (SE) has seen the emergence of dee...
research
04/30/2018

Investigations on End-to-End Audiovisual Fusion

Audiovisual speech recognition (AVSR) is a method to alleviate the adver...
research
08/13/2019

Estimating Mitigating the Impact of Acoustic Environments on Machine-to-Machine Signalling

The advance of technology for transmitting Data-over-Sound in various Io...

Please sign up or login with your details

Forgot password? Click here to reset