On the Use of a Spectral Glottal Model for the Source-filter Separation of Speech

12/21/2017
by   Olivier Perrotin, et al.
0

The estimation of glottal flow from a speech waveform is a key method for speech analysis and parameterization. Significant research effort has been made to dissociate the first vocal tract resonance from the glottal formant (the low-frequency resonance describing the open-phase of the vocal fold vibration). However few methods cope with estimation of high-frequency spectral tilt to describe the return-phase of the vocal fold vibration, which is crucial to the perception of vocal effort. This paper proposes an improved version of the well-known Iterative Adaptive Inverse Filtering (IAIF) called GFM-IAIF. GFM-IAIF includes a full spectral model of the glottis that incorporates both glottal formant and spectral tilt features. Comparisons with the standard IAIF method show that while GFM-IAIF maintains good performance on vocal tract removal, it significantly improves the perceptive timbral variations associated to vocal effort.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/20/2022

Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise

We present a neural text-to-speech (TTS) method that models natural voca...
research
12/28/2019

A Comparative Study of Glottal Source Estimation Techniques

Source-tract decomposition (or glottal flow estimation) is one of the ba...
research
05/24/2020

Glottal source estimation robustness: A comparison of sensitivity of voice source estimation techniques

This paper addresses the problem of estimating the voice source directly...
research
12/29/2019

Complex Cepstrum-based Decomposition of Speech for Glottal Source Estimation

Homomorphic analysis is a well-known method for the separation of non-li...
research
03/31/2022

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping

Neural vocoder using denoising diffusion probabilistic model (DDPM) has ...
research
02/17/2020

Lifter Training and Sub-band Modeling for Computationally Efficient and High-Quality Voice Conversion Using Spectral Differentials

In this paper, we propose computationally efficient and high-quality met...

Please sign up or login with your details

Forgot password? Click here to reset