Analysing Deep Learning-Spectral Envelope Prediction Methods for Singing Synthesis

03/04/2019
by   Frederik Bous, et al.
0

We conduct an investigation on various hyper-parameters regarding neural networks used to generate spectral envelopes for singing synthesis. Two perceptive tests, where the first compares two models directly and the other ranks models with a mean opinion score, are performed. With these tests we show that when learning to predict spectral envelopes, 2d-convolutions are superior over previously proposed 1d-convolutions and that predicting multiple frames in an iterated fashion during training is superior over injecting noise to the input data. An experimental investigation whether learning to predict a probability distribution vs. single samples was performed but turned out to be inconclusive. A network architecture is proposed that incorporates the improvements which we found to be useful and we show in our experiments that this network produces better results than other stat-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2022

Predicting score distribution to improve non-intrusive speech quality estimation

Deep noise suppressors (DNS) have become an attractive solution to remov...
research
07/06/2022

Ordinal Regression via Binary Preference vs Simple Regression: Statistical and Experimental Perspectives

Ordinal regression with anchored reference samples (ORARS) has been prop...
research
08/06/2020

Ultrasound-based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis

For articulatory-to-acoustic mapping using deep neural networks, typical...
research
02/17/2021

Genetically Optimized Prediction of Remaining Useful Life

The application of remaining useful life (RUL) prediction has taken grea...
research
04/23/2022

Improving Self-Supervised Learning-based MOS Prediction Networks

MOS (Mean Opinion Score) is a subjective method used for the evaluation ...

Please sign up or login with your details

Forgot password? Click here to reset