Comprehensive evaluation of statistical speech waveform synthesis

11/15/2018
by   Thomas Merritt, et al.
0

Statistical TTS systems that directly predict the speech waveform have recently reported improvements in synthesis quality. This investigation evaluates Amazon's statistical speech waveform synthesis (SSWS) system. An in-depth evaluation of SSWS is conducted across a number of domains to better understand the consistency in quality. The results of this evaluation are validated by repeating the procedure on a separate group of testers. Finally, an analysis of the nature of speech errors of SSWS compared to hybrid unit selection synthesis is conducted to identify the strengths and weaknesses of SSWS. Having a deeper insight into SSWS allows us to better define the focus of future work to improve this new technology.

READ FULL TEXT

page 3

page 4

research
06/12/2021

Continuous Wavelet Vocoder-based Decomposition of Parametric Speech Waveform Synthesis

To date, various speech technology systems have adopted the vocoder appr...
research
09/12/2023

CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram

In this work, we present CleanUNet 2, a speech denoising model that comb...
research
10/11/2021

LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example

Emotional and controllable speech synthesis is a topic that has received...
research
11/21/2022

Embedding a Differentiable Mel-cepstral Synthesis Filter to a Neural Speech Synthesis System

This paper integrates a classic mel-cepstral synthesis filter into a mod...
research
09/19/2019

WEnets: A Convolutional Framework for Evaluating Audio Waveforms

We describe a new convolutional framework for waveform evaluation, WEnet...
research
11/29/2018

LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis

We propose a linear prediction (LP)-based waveform generation method via...
research
08/20/2018

Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks

We propose the multi-head convolutional neural network (MCNN) architectu...

Please sign up or login with your details

Forgot password? Click here to reset