High-quality Speech Synthesis Using Super-resolution Mel-Spectrogram

12/03/2019
by   Leyuan Sheng, et al.
0

In speech synthesis and speech enhancement systems, melspectrograms need to be precise in acoustic representations. However, the generated spectrograms are over-smooth, that could not produce high quality synthesized speech. Inspired by image-to-image translation, we address this problem by using a learning-based post filter combining Pix2PixHD and ResUnet to reconstruct the mel-spectrograms together with super-resolution. From the resulting super-resolution spectrogram networks, we can generate enhanced spectrograms to produce high quality synthesized speech. Our proposed model achieves improved mean opinion scores (MOS) of 3.71 and 4.01 over baseline results of 3.29 and 3.84, while using vocoder Griffin-Lim and WaveNet, respectively.

READ FULL TEXT

page 3

page 5

research
05/15/2023

Screentone-Aware Manga Super-Resolution Using DeepLearning

Manga, as a widely beloved form of entertainment around the world, have ...
research
05/18/2023

mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra

Speech super-resolution (SSR) aims to recover a high resolution (HR) spe...
research
10/12/2020

LASSR: Effective Super-Resolution Method for Plant Disease Diagnosis

The collection of high-resolution training data is crucial in building r...
research
09/28/2021

VoiceFixer: Toward General Speech Restoration with Neural Vocoder

Speech restoration aims to remove distortions in speech signals. Prior m...
research
08/26/2022

Laplacian Pyramid-like Autoencoder

In this paper, we develop the Laplacian pyramid-like autoencoder (LPAE) ...
research
03/16/2023

Mimic3D: Thriving 3D-Aware GANs via 3D-to-2D Imitation

Generating images with both photorealism and multiview 3D consistency is...
research
09/20/2021

DEM Super-Resolution with EfficientNetV2

Efficient climate change monitoring and modeling rely on high-quality ge...

Please sign up or login with your details

Forgot password? Click here to reset