Glottal Source Processing: from Analysis to Applications

12/29/2019
by   Thomas Drugman, et al.
0

The great majority of current voice technology applications relies on acoustic features characterizing the vocal tract response, such as the widely used MFCC of LPC parameters. Nonetheless, the airflow passing through the vocal folds, and called glottal flow, is expected to exhibit a relevant complementarity. Unfortunately, glottal analysis from speech recordings requires specific and more complex processing operations, which explains why it has been generally avoided. This review gives a general overview of techniques which have been designed for glottal source processing. Starting from fundamental analysis tools of pitch tracking, glottal closure instant detection, glottal flow estimation and modelling, this paper then highlights how these solutions can be properly integrated within various voice technology applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2020

Data-driven Detection and Analysis of the Patterns of Creaky Voice

This paper investigates the temporal excitation patterns of creaky voice...
research
05/10/2020

Chirp Complex Cepstrum-based Decomposition for Asynchronous Glottal Analysis

It was recently shown that complex cepstrum can be effectively used for ...
research
11/25/2018

Glottal Closure Instants Detection From Pathological Acoustic Speech Signal Using Deep Learning

In this paper, we propose a classification based glottal closure instant...
research
12/30/2019

Causal-Anticausal Decomposition of Speech using Complex Cepstrum for Glottal Source Estimation

Complex cepstrum is known in the literature for linearly separating caus...
research
01/02/2020

Phase-based Information for Voice Pathology Detection

In most current approaches of speech processing, information is extracte...
research
07/19/2022

Machine-learning applied to classify flow-induced sound parameters from simulated human voice

Disorders of voice production have severe effects on the quality of life...
research
03/26/2019

WGANSing: A Multi-Voice Singing Voice Synthesizer Based on the Wasserstein-GAN

We present a deep neural network based singing voice synthesizer, inspir...

Please sign up or login with your details

Forgot password? Click here to reset