Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks

08/31/2018
by   Santiago Pascual, et al.
0

Most methods of voice restoration for patients suffering from aphonia either produce whispered or monotone speech. Apart from intelligibility, this type of speech lacks expressiveness and naturalness due to the absence of pitch (whispered speech) or artificial generation of it (monotone speech). Existing techniques to restore prosodic information typically combine a vocoder, which parameterises the speech signal, with machine learning techniques that predict prosodic information. In contrast, this paper describes an end-to-end neural approach for estimating a fully-voiced speech waveform from whispered alaryngeal speech. By adapting our previous work in speech enhancement with generative adversarial networks, we develop a speaker-dependent model to perform whispered-to-voiced speech conversion. Preliminary qualitative results show effectiveness in re-generating voiced speech, with the creation of realistic pitch contours.

READ FULL TEXT
research
10/30/2018

Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech

This paper focuses on using voice conversion (VC) to improve the speech ...
research
03/28/2017

SEGAN: Speech Enhancement Generative Adversarial Network

Current speech enhancement techniques operate on the spectral domain and...
research
04/03/2018

Speech waveform synthesis from MFCC sequences with generative adversarial networks

This paper proposes a method for generating speech from filterbank mel f...
research
09/06/2021

Machine Learning: Challenges, Limitations, and Compatibility for Audio Restoration Processes

In this paper machine learning networks are explored for their use in re...
research
05/15/2020

Unsupervised Cross-Domain Speech-to-Speech Conversion with Time-Frequency Consistency

In recent years generative adversarial network (GAN) based models have b...
research
06/05/2022

Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator

Understanding the underlying relationship between tongue and oropharynge...

Please sign up or login with your details

Forgot password? Click here to reset