Parametric Representation for Singing Voice Synthesis: a Comparative Evaluation

06/07/2020
by   Onur Babacan, et al.
0

Various parametric representations have been proposed to model the speech signal. While the performance of such vocoders is well-known in the context of speech processing, their extrapolation to singing voice synthesis might not be straightforward. The goal of this paper is twofold. First, a comparative subjective evaluation is performed across four existing techniques suitable for statistical parametric synthesis: traditional pulse vocoder, Deterministic plus Stochastic Model, Harmonic plus Noise Model and GlottHMM. The behavior of these techniques as a function of the singer type (baritone, counter-tenor and soprano) is studied. Secondly, the artifacts occurring in high-pitched voices are discussed and possible approaches to overcome them are suggested.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/29/2019

A Deterministic plus Stochastic Model of the Residual Signal for Improved Parametric Speech Synthesis

Speech generated by parametric synthesizers generally suffers from a typ...
research
03/05/2022

NeuralDPS: Neural Deterministic Plus Stochastic Model with Multiband Excitation for Noise-Controllable Waveform Generation

The traditional vocoders have the advantages of high synthesis efficienc...
research
12/29/2019

A Comparative Study of Pitch Extraction Algorithms on a Large Variety of Singing Sounds

The problem of pitch tracking has been extensively studied in the speech...
research
06/23/2022

Adversarial Multi-Task Learning for Disentangling Timbre and Pitch in Singing Voice Synthesis

Recently, deep learning-based generative models have been introduced to ...
research
01/02/2020

A Comparative Evaluation of Pitch Modification Techniques

This paper addresses the problem of pitch modification, as an important ...
research
07/10/2022

A Comparative Study of Self-supervised Speech Representation Based Voice Conversion

We present a large-scale comparative study of self-supervised speech rep...
research
12/29/2019

The Deterministic plus Stochastic Model of the Residual Signal and its Applications

The modeling of speech production often relies on a source-filter approa...

Please sign up or login with your details

Forgot password? Click here to reset