Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion

11/27/2018
by   Wen-Chin Huang, et al.
0

This paper presents a refinement framework of WaveNet vocoders for variational autoencoder (VAE) based voice conversion (VC), which reduces the quality distortion caused by the mismatch between the training data and testing data. Conventional WaveNet vocoders are trained with natural acoustic features but condition on the converted features in the conversion stage for VC, and such mismatch often causes significant quality and similarity degradation. In this work, we take advantage of the particular structure of VAEs to refine WaveNet vocoders with the self-reconstructed features generated by VAE, which are of similar characteristics with the converted features while having the same data length with the target training data. In other words, our proposed method does not require any alignment. Objective and subjective experimental results demonstrate the effectiveness of our proposed framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/24/2019

Non-Parallel Voice Conversion with Cyclic Variational Autoencoder

In this paper, we present a novel technique for a non-parallel voice con...
research
09/15/2019

Voice Conversion Using Cycle-Consistent Variational Autoencoder

One of the most critical obstacles in voice conversion is the requiremen...
research
05/02/2019

Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion

In this work, we investigate the effectiveness of two techniques for imp...
research
05/28/2019

Automatic Quality Control and Enhancement for Voice-Based Remote Parkinson's Disease Detection

The performance of voice-based Parkinson's disease (PD) detection system...
research
05/18/2020

A Cyclical Post-filtering Approach to Mismatch Refinement of Neural Vocoder for Text-to-speech Systems

Recently, the effectiveness of text-to-speech (TTS) systems combined wit...
research
06/30/2022

An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions

This paper presents a new voice conversion (VC) framework capable of dea...

Please sign up or login with your details

Forgot password? Click here to reset