CycleGAN with Dual Adversarial Loss for Bone-Conducted Speech Enhancement

11/02/2021
by   Qing Pan, et al.
0

Compared with air-conducted speech, bone-conducted speech has the unique advantage of shielding background noise. Enhancement of bone-conducted speech helps to improve its quality and intelligibility. In this paper, a novel CycleGAN with dual adversarial loss (CycleGAN-DAL) is proposed for bone-conducted speech enhancement. The proposed method uses an adversarial loss and a cycle-consistent loss simultaneously to learn forward and cyclic mapping, in which the adversarial loss is replaced with the classification adversarial loss and the defect adversarial loss to consolidate the forward mapping. Compared with conventional baseline methods, it can learn feature mapping between bone-conducted speech and target speech without additional air-conducted speech assistance. Moreover, the proposed method also avoids the oversmooth problem which is occurred commonly in conventional statistical based models. Experimental results show that the proposed method outperforms baseline methods such as CycleGAN, GMM, and BLSTM. Keywords: Bone-conducted speech enhancement, dual adversarial loss, Parallel CycleGAN, high frequency speech reconstruction

READ FULL TEXT
research
09/06/2018

Adversarial Feature-Mapping for Speech Enhancement

Feature-mapping with deep neural networks is commonly used for single-ch...
research
10/07/2019

Impulsive Noise Detection for Intelligibility and Quality Improvement of Speech Enhancement Methods Applied in Time-Domain

This letter introduces a novel speech enhancement method in the Hilbert-...
research
09/15/2022

MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement

Speech enhancement improves speech quality and promotes the performance ...
research
02/24/2022

Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement

Modern neural speech enhancement models usually include various forms of...
research
09/06/2018

Cycle-Consistent Speech Enhancement

Feature mapping using deep neural networks is an effective approach for ...
research
02/10/2022

Single-channel speech enhancement by using psychoacoustical model inspired fusion framework

When the parameters of Bayesian Short-time Spectral Amplitude (STSA) est...
research
08/21/2019

Coarse-to-fine Optimization for Speech Enhancement

In this paper, we propose the coarse-to-fine optimization for the task o...

Please sign up or login with your details

Forgot password? Click here to reset