Adversarial Feature-Mapping for Speech Enhancement

09/06/2018
by   Zhong Meng, et al.
0

Feature-mapping with deep neural networks is commonly used for single-channel speech enhancement, in which a feature-mapping network directly transforms the noisy features to the corresponding enhanced ones and is trained to minimize the mean square errors between the enhanced and clean features. In this paper, we propose an adversarial feature-mapping (AFM) method for speech enhancement which advances the feature-mapping approach with adversarial learning. An additional discriminator network is introduced to distinguish the enhanced features from the real clean ones. The two networks are jointly optimized to minimize the feature-mapping loss and simultaneously mini-maximize the discrimination loss. The distribution of the enhanced features is further pushed towards that of the clean features through this adversarial multi-task training. To achieve better performance on ASR task, senone-aware (SA) AFM is further proposed in which an acoustic model network is jointly trained with the feature-mapping and discriminator networks to optimize the senone classification loss in addition to the AFM losses. Evaluated on the CHiME-3 dataset, the proposed AFM achieves 16.95 (WER) improvements over the real noisy data and the feature-mapping baseline respectively and the SA-AFM achieves 9.85 multi-conditional acoustic model.

READ FULL TEXT
research
09/06/2018

Cycle-Consistent Speech Enhancement

Feature mapping using deep neural networks is an effective approach for ...
research
11/02/2021

CycleGAN with Dual Adversarial Loss for Bone-Conducted Speech Enhancement

Compared with air-conducted speech, bone-conducted speech has the unique...
research
11/06/2018

Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech Recognition

Many speech enhancement methods try to learn the relationship between no...
research
05/17/2020

Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild

We investigated an enhancement and a domain adaptation approach to make ...
research
09/25/2018

An Exploration of Mimic Architectures for Residual Network Based Spectral Mapping

Spectral mapping uses a deep neural network (DNN) to map directly from n...
research
03/25/2022

Speech-enhanced and Noise-aware Networks for Robust Speech Recognition

Compensation for channel mismatch and noise interference is essential fo...
research
11/11/2020

Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning

Recurrent neural networks (RNNs) have shown significant improvements in ...

Please sign up or login with your details

Forgot password? Click here to reset