LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example

10/11/2021
by   Hieu-Thi Luong, et al.
1

Emotional and controllable speech synthesis is a topic that has received much attention. However, most studies focused on improving the expressiveness and controllability in the context of linguistic content, even though natural verbal human communication is inseparable from spontaneous non-speech expressions such as laughter, crying, or grunting. We propose a model called LaughNet for synthesizing laughter by using waveform silhouettes as inputs. The motivation is not simply synthesizing new laughter utterances but testing a novel synthesis-control paradigm that uses an abstract representation of the waveform. We conducted basic listening test experiments, and the results showed that LaughNet can synthesize laughter utterances with moderate quality and retain the characteristics of the training example. More importantly, the generated waveforms have shapes similar to the input silhouettes. For future work, we will test the same method on other types of human nonverbal expressions and integrate it into more elaborated synthesis systems.

READ FULL TEXT
research
06/12/2021

Continuous Wavelet Vocoder-based Decomposition of Parametric Speech Waveform Synthesis

To date, various speech technology systems have adopted the vocoder appr...
research
11/15/2018

Comprehensive evaluation of statistical speech waveform synthesis

Statistical TTS systems that directly predict the speech waveform have r...
research
09/12/2023

CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram

In this work, we present CleanUNet 2, a speech denoising model that comb...
research
06/25/2018

EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System

We present EMPHASIS, an emotional phoneme-based acoustic model for speec...
research
08/22/2023

Evaluation of the Speech Resynthesis Capabilities of the VoicePrivacy Challenge Baseline B1

Speaker anonymization systems continue to improve their ability to obfus...
research
08/30/2018

Contribution of Glottal Waveform in Speech Emotion: A Comparative Pairwise Investigation

In this work, we investigated the contribution of the glottal waveform i...
research
09/19/2019

WEnets: A Convolutional Framework for Evaluating Audio Waveforms

We describe a new convolutional framework for waveform evaluation, WEnet...

Please sign up or login with your details

Forgot password? Click here to reset