Speech Enhancement-assisted Stargan Voice Conversion in Noisy Environments

10/19/2021
by   Yun-Ju Chan, et al.
0

Numerous voice conversion (VC) techniques have been proposed for the conversion of voices among different speakers. Although the decent quality of converted speech can be observed when VC is applied in a clean environment, the quality will drop sharply when the system is running under noisy conditions. In order to address this issue, we propose a novel enhancement-based StarGAN (E-StarGAN) VC system, which leverages a speech enhancement (SE) technique for signal pre-processing. SE systems are generally used to reduce noise components in noisy speech and to generate enhanced speech for downstream application tasks. Therefore, we investigated the effectiveness of E-StarGAN, which combines VC and SE, and demonstrated the robustness of the proposed approach in various noisy environments. The results of VC experiments conducted on a Mandarin dataset show that when combined with SE, the proposed E-StarGAN VC model is robust to unseen noises. In addition, the subjective listening test results show that the proposed E-StarGAN model can improve the sound quality of speech signals converted from noise-corrupted source utterances.

READ FULL TEXT
research
03/22/2022

Joint Noise Reduction and Listening Enhancement for Full-End Speech Enhancement

Speech enhancement (SE) methods mainly focus on recovering clean speech ...
research
08/21/2020

CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application

In this paper, we present a deep learning-based speech signal-processing...
research
11/19/2019

Distributed Microphone Speech Enhancement based on Deep Learning

Speech-related applications deliver inferior performance in complex nois...
research
12/22/2016

Robustness of Voice Conversion Techniques Under Mismatched Conditions

Most of the existing studies on voice conversion (VC) are conducted in a...
research
10/19/2021

Speech Enhancement Based on Cyclegan with Noise-informed Training

Speech enhancement (SE) approaches can be classified into supervised and...
research
01/29/2021

Speech Enhancement for Wake-Up-Word detection in Voice Assistants

Keyword spotting and in particular Wake-Up-Word (WUW) detection is a ver...
research
11/25/2022

Stereo Speech Enhancement Using Custom Mid-Side Signals and Monaural Processing

Speech Enhancement (SE) systems typically operate on monaural input and ...

Please sign up or login with your details

Forgot password? Click here to reset